Twistless KAM tori, quasi flat homoclinic intersections, and other cancellations in the pertur- 
bation series of certain completely integrable hamiltonian systems. A review.* 
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Abstract: Rotators interacting with a pendulum via small, velocity independent, potentials are considered. 
If the interaction potential does not depend on the pendulum position then the pendulum and the rotators 
are decoupled and we study the invariant tori of the rotators system at fixed rotation numbers: we exhibit 
cancellations, to all orders of perturbation theory, that allow proving the stability and analyticity of the 
dipohantine tori. We find in this way a proof of the KAM theorem by direct bounds of the k-th order 
coefEcient of the perturbation expansion of the parametric equations of the tori in terms of their average 
anomalies: this extends SiegeVs approach, from the linearization of analytic maps to the KAM theory; the 
convergence radius does not depend, in this case, on the twist strength, which could even vanish ("twistless 
KAM tori"). The same ideas apply to the case in which the potential couples the pendulum and the rotators: 
in this case the invariant tori with diophantine rotation numbers are unstable and have stable and unstable 
manifolds ("whiskers"): instead of studying the perturbation theory of the invariant tori we look for the 
cancellations that must be present because the homoclinic intersections of the whiskers are "quasi flat", if 
the rotation velocity of the quasi periodic motion on the tori is large. We rederive in this way the result that, 
under suitable conditions, the homoclinic splitting is smaller than any power in the period of the forcing and 
hnd the exact asymptotics in the two dimensional cases (e.g. in the case of a periodically forced pendulum). 
The technique can be applied to study other quantities: we mention, as another example, the homoclinic 
scattering phase shifls. 
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§1 Introduction 

We discuss the invariant tori and the spUtting of their homochnic stable and unstable manifolds for a 
special class of quasi integrable hamiltonian systems. We apply and extend, in the considered class, the 
important ideas of Melnikov and Eliasson respectively on the theory of low dimensional invariant tori and 
their manifolds, [Me], and on the cancellations behind the convergence of the formal perturbation series for 
the invariant tori of maximal dimension, [E], i.e. for the KAM tori ([K],[A2],[M]). We also point out the 
analogy of the methods with those used in quantum field theory, particularly in the renormalization group 
approaches, [G2]. 

The ideas of Melnikov and Eliasson have been around since quite a while: but it seems that few realized their 
importance; a possible explanation is that the original papers are plagued by excessive generality. Assuming 
that this is the reason it is desirable to show what they imply in a "simple" case. 

There are, however, no really simple cases (as it is well known). 

In his book on mathematical physics Thirring made an attempt to find a "simple" model to explain the 
KAM theory, [T]. Although his models are essentially as difficult to treat as the most general hamiltonian 
system with twist, some aspects of the proofs are somewhat simpler and leave us with the feeling that 
something more can still be done. The class of models consists of a family of rotators (i.e. points on circles 
or, if one deems this too abstract, cylinders with fixed axis), say / — 1 in number, interacting with a pendulum 
via a conservative force. The inertia moments Jj, j = 1, . . . , ^ — 1, of the rotators form a matrix J which is 
diagonal. And they are supposed to be Jj > Jo > 0, if Jq is the inertia of the pendulum. Actually Thirring 
has no pendulum, just a system of rotators: this can be viewed as a special case in which the pendulum and 
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the rotators do not interact. Also he considers a somewhat more general model than model 1), by allowing 

the matrix J to depend on a. 

More formally: we shall consider the I degrees of freedom hamiltonian = Hq + /u/ given by one of the 
following two expressions: 



1) W-A+-J ^A-A + ij, ^ fpcosa-u 

0<\v\<N 




where (/,</?) G cJ) G i?^*-' are canonically conjugated variables, uj <E R!' v= {n,v) G Z'- , \v\ = 

|n| + = \n\ + and J, Jo > (respectively, "rotator's inertia moment" and "pendulum inertia 

moment"), g > {) [g^ is the "gravity"), are fixed constants. And w, /z are parameters. 
To avoid trivial, or lengthy, comments here and there wc suppose that J > Jq with Jo > fixed forever, 
setting a scale for the size of the inertia moments, (we do not take it 1 as we are convinced that it is better 
not to fix units of measure), and that = or /^^ g = 0, for all n, (for what concerns the trivial comments), 
and l/jy] < Jqw^ (for the lengthy ones): this will be clearly not restrictive. 

Model 2) will be called rotator-pendulum model, or simple resonance model (see below for motivation) or 
Arnold model, and model 1) will be called rotator model or Thirring model: they deserve a name, because 
they are a very interesting class with many properties that cannot be discussed here, the analysis of a few 
(non trivial) of which was begun in [A] for the first and in [T] for the second. 

Model 1) has many applications in plasma physics, and celestial mechanics. Model 2) has great relevance 
for the analysis of the breakdown of invariant tori and the corresponding universal behaviour. 
In this paper we suppose a priori that: 

Hypothesis H: the parameters uj, n verify, in general: 

^=^, |M|<6r;«, r?<l (1.2) 

with Q and which will be restricted to be large enough in the course of the analysis. In the case of model 
1) we shall also fix t] = 1,Q = 0. 

and: 



Hypothesis H': ujo is a diophantine vector, i.e. : 

|wo • i7| > ' for all6^i7G Z'-i (1.3) 

for some diophantine constant Co and some diophantine exponent r > 0. 

A natural energy scale for the model 1) will be -E = Jow^ and for model 2) one can take E = Jog^. 
We prove the following theorem for model 1): 

Theorem 1 (Twistless KAM): Consider the solution flow of the hamiltonian equations of the Thirring model 
1), under the assumptions H,H' above. Then there exists a constant b > Q and a function (dimensionless) 
x{^p ofip G r'~^, yu G R, holomorphic in p and tp , in the complex domains < e~^% and \ Im'^j] < ^, 
parameterized by ^> 0, and bounded by 1, such that the: 

_ -> E \ - ^ , , sin i7 • ■0 r „ \ ^ _ , , cos v ■ ih 

a = V -Hj}_^xpip) , A = -nE2_^Xff{ii) (1.4) 

with ip G T'~^, E = Jow^, are the parametric equations of an invariant torus (for the flow) on which the flow 
is quasi periodic with spectrum uj and the angles tp o-re its average anomalies: i.e. the flow is ip ^ ip +u}t, 
so that the angles tpj rotate at constant velocity ujj. The constant b depends only on Jq,uj'^ ,Cq,t, N,l; and 
Xj^i^i) — X-p{ii) denotes the Fourier transform of x{%p 
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Remarks: 

1) The fact that the theorem is uniform, in the size of = det J~^, provided J > Jo, means that it holds even 
if the twist = 0. This is the reason we cah the above theorem "twistless" KAM theorem. We prove it, 
under a slightly stronger diophantine hypothesis (see (7.1)), by finding a bound on the fc-th order of the 
/i -expansion of the function x: see §7 and Appendices A3,A4. 

2) If some or all the inertia moments Jj are +oo the above theeorem is an easy consequence of the classical 
KAM theorem (or better of its proof); see comments around (4.16) for the case Jj = + oo. However the 
boimd b obtained via the classical proof depends on the twist rate, i.e. on the maximum among the .Jj 
which are not +oo, and diverges as the rate approaches 0. This non uniformity is quite surprising, but it is 
an artifact of the classical proof (as a direct careful analysis of the latter also shows). 

Our second class of results concerns the simple resonance model, 2) in (1.1). 

The I = 2 and J = +oo case will not be excluded and corresponds to the "pendulum in a periodic force 
field"; if Z > 2 we take J < +oo, for simplicity, to be a constant but our results can be extended, essentially 
unchanged, to the case in which J is a diagonal matrix with the first element zero and the others positive 
(in fact one can take J to be a {I — 1) x (Z — 1) symmetric matrix with the first row and columns zero and 
with rank 1 — 2). This is also implicitly shown, in the framework of the classical proofs, by the proof of KAM 
theorem for model 1) developed in [T], if particularized to model 1): it yields results uniform in the twist 
rate. This is probably the case, as well, for the original proof of Kolmogorov, [K], if particularized to model 
l). 

Remark: To motivate the name of " simple resonance model" and hypothesis H one should recognize that 
model 2) presents the "basic" structure (after a few simple canonical changes of variables) of a perturbation 
of a completely integrable system, near a "simple resonance" (see [BG], [CG]). Suppose that the original 
(i.e. before the above mentioned change of variables) unperturbed hamiltonian is non degenerate, and 77, 
< 77 < 1, is a measure of the size of the original perturbation. Then, provided 77 is small enough, one finds 
that the parameters w and fi verify: lo = ujq rj^^^^, \tA ^ v'^" 1 where Qo > 1 can be prefixed at the beginning 
of the construction of the change of coordinates, . This is, perhaps, the strongest motivation to study the 
hamiltonians (1.1), model 2), with uj given proportional to l/^/rj. 

For 11 = 0, the hamiltonian equations generated by (1.1), {i.e. I = —d^Hij^, (p = djHi^, A = —dsHi^, 
a = d^Hn), admit {I — l)-dimensional invariant tori: 

To = {I = = ip}x{A = A°,aeT^-^} (1.5) 

possessing homoclinic stable and unstable manifolds, called "whiskers". The manifolds equations are: 

W^ = Wo = {£-^+g^Mcos^-l)=0}x{A = A° ,aeT^-'} (1.6) 

The integrability is reflected also by the degeneracy property that Wq = Wq . 

Then, it follows "from KAM theory", [Me],[E],[CG], that "many" imperturbcd tori aroimd the torus = 
(including the one A'^ = itself) can be continued analytically (in jj), togheter with their whiskers, into 
invariant tori with the same l3, for all \fi\ < brj'^ (if & is a suitable constant, explicitly computable in terms 
of a few parameters associated with i?o,./ in (1-1), sec for instancefCG]) and for Q large enough; we shall 
call such tori persistent. The determination of b, Q reqires going through an analysis very similar to that 
of the classical KAM theorem: hence we say that such tori and whiskers are "obtained by KAM analj^tic 
continuation".^. 

^ If one wants to get an idea of the kind of numbers that might be involved here (which is not logically necessary for reading the 

present paper) one can look at [CG] . It follows from [CG] that Q can be taken to be 10: see (5.76) of [CG] where Eo = 0(r;-i/2) 
and all other parameters are »;-independent; including Co, which is not related to the Co of the present paper because in [CG] 
the quoted inequality was obtained under the hypothesis that the constant bearing there the same name, besides being a bound 
on the diophantine constant for uj, was also larger than g^^ ■ In the present case the is of order 0(1), while the diophantine 
constant for oj is, by (1-2), (1.3), Cor}^^^ so that in applying the quoted result to our case we are forced to take the constant 
Co in (5.76) of [CG] of order 0(1). 

Note that in applying the results of [CG] to find an estimate for Q we apply the (5.76) and not the final result (5.90), which 
would give a more stringent condition {Q = 71), because we are using here only an intermediary result whose proof was 
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Remark: If / = 2, however, one does not need an elaborated method and a direct elementary check of the 
persistence of the invariant tori, which in this case arc periodic orbits, is possible, togheter with a reasonably 
straightforward estimate of b, Q. One finds, for instance, that for all J < +00 the value Q = 1 is sufficient, 
(hint: write the persistence condition as a fixed point equation for the Poincare map of the periodic orbit). 

In some applications the (1.2) is not a very strong condition because of the remark following (1.1) whereby 
the Qo mentioned there can be taken as large as wished, by finding suitable coordinates. In particular, if 
Qo = Q + 1 then the analyticity domain in fj, contains, well inside, the value r]'^° for rj small enough.^ 

We shall always suppose that /i is taken small enough so that the invariant torus that we are considering 
is persistent: i.e. we shall always suppose that < brj'^ where Q,b are the constants mentioned in the 
discussion following the hypothesis (1.2), so that the above persistence properties hold (the values of the 
constants can be determined in terms of properties of Ho,f in (1.1), see for instance [CG] where such 
estimates are obtained). 

We shall denote by Wj^ and 7^ the stable and unstable whiskers and, respectively, the whiskered tori 

obtained by the KAM analytic continuation (in /i) . The stable and unstable whiskers arc characterized 
by the fact that distance(S'*X^, 7^) exponentially fast as t ^ ±00; here S'* is the hamiltonian flow 
generated by (1.1). The flow on the persistent whiskered tori can be described, in suitable coordinates and 
for all small (i.e. < brj^), by ijj ^ i]j + uj't, for suitable diophantine vectors w' independent of jjL 
{e.g. Lo' = uj m the continuation of the unperturbed torus A = Q, I = = (p). 

A legacy of the unperturbed situation (see (1.6)) is that the persistent whiskers arising from (1.6) are, 
for small enough, graphs over the angles a e T'~^ and at least if \ip\ <2-k — 5 for any preflxed 5 > 0; 
hence they can be written as: 

= {(/, A, <p, a) = (/±(a, ^), A± («, ^), ^, a) : a e T'-\ \^\ < 27r - 5} (1.7) 

for suitable real analytic (in (a, ip) and /i) functions A^ , ■ It is also not difficult to check that the parity 
(in (a, (f) of (1.1) implies that {a, (f) = (0, tt) is a homoclinic point, i.e. (0, tt) G H W~ (see (4.18) below), 
for all (U small enough (so small that the above tori and whiskers can be proved to exist). 
In this context, it is natural to measure the splitting between W+ and W~ at <^ = tt and a = by the 
quantity: 

6{a)= detds{Al-A-)\^=^ (1.8) 

and its a-derivatives at a = 0. 

Using the theory normal forms ([N], see also [BG]), one can show, see [Nei], that if < br]'^ then 5 is smaller 
than any power in r] as rj ^ Q. 

Here we use an algorithm derived in [CG], (see also §2 below), for the computation of the /x-expansion 

coefficients of the functions A^ , . And we derive the above smallncss rcsiilt by explicitly checking several 
interesting cancellation mechanisms, operating to all orders of perturbation theory, and which are behind 
the smallness of the splitting 5 when /x < brj^ , see also [E] , [ACKR] . 

As a byproduct we obtain, and extend, the results of [Nei]. We obtain also the exact asymptotics in the 
cases I = 2, (with < J < +00), see §8: 

Theorem 2 (Quasi flat homoclinic intersections): Let I = 2, J > Jq, J < +00; let Nq be the degree in (p 
of the trigonometric polynomial f in (1.1); then, provided rj is small enough: 

m = ^p^A.e-^{l + 0{^)) (1.9) 

if < r]i with q>3No + 5, if A, = -5"'(^)^'^''"'(/iVo,i + /jVo,-i)^I^^, provided A, ^ 0. And in 
fact S{a) is a holomorphic function of a for | Imaj < ^, for all Q > if r} is small enough (depending on 
C). And: 

^^C^Jvl^lTae"^, |Ima|<C (1.10) 

completed under the condition (5.76). Such result is the persistence of a single invariant torus with given rotation vector oj: 

the paper [CG] was concerned with the persistence of a whole family of invariant tori, and subject to the condition of lying on 
a surface of prefixed energy (which led to add to (5.76) two further conditions, (5.81), (5.85) in [CG]). 

In any event the above bounds should not be taken too seriously from a quantitative viewpoint as they are very likely far from 
optimal (as no effort at all was devoted to obtaining numerically good bounds). 
e.g. Qo = 11 according to the just mentioned estimates in [CG]. 
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for a suitable > and for t] small enough (depending on q,C)- 
Remarks: 

1) The optimal result is probably q > Nq — ^. In Appendix Al we sketch how to get easily a better result: 
q> No + ^ instead of the pessimistic one q > SNq + 5 given above. 

2) For completeness we quickly rederive, in §2 and §4, the recursive formulae for the whiskers (see [CG], 
appendix 13). 

3) It will appear that our technique consists in representing the coefficient of the /x'^-th order contribution to 
S{d) as a time integral from t = — oo to t = +oo. The estimates should follow if analyticity properties of 
the integrands, allowing the shift of the t integration to a region where the integrand is small, could be 
checked. 

Such an idea has been present in the literature since a long time, and sometimes it led to errors. It is 
also behind the available studies of the asymptotic behaviour of 5 (a) as rj ^ (and jj, = r/'^ with Q large 
enough). 

In our representation of S{d) the above approach is not so straightforward: we cannot prove the holomorphy 
of the integrand in the integral representation of the k-th order coefficient of the ji expansion of 5{a), which 
in fact is not holomorphic in any useful region for the above idea to work literally, for k > 2. Rather we 
show that the; integrand can be written as a sum of a holomorphic term which is small. bc!(;ausc of an 
analyticity argument like the one mentioned above, and of a non holomorphic term which, however, is "as 
small as needed" if the bounds on the coefficients of the lower orders are "as small as needed" . Since the 
first order trivially has all the necessary properties, the result is obtained by induction. 

4) It is worth mentioning a related property, discovered in [CG] , that will not be needed nor discussed further 
in the present work but that is closely related to theorem 2. 

In many cases in which a hamiltonian like model 2) arises in the analysis of a resonant motion of a slightly 
pertubed completely integrable system, it happens that the original unperturbed hamiltonian is strongly 
degenerate, i.e. it does not depend on all the action variables (a feature common in celestial mechanics). 
In such cases, that we shall call "degenerate", the vector u in (1.1) has (at least) one component which is 
of O(v^). 

More physically one can say that the models 2) verifying (1.2), (1.3) contain two basic time scales, of order 
0(Ty^/^) ("fast") and 0(1) ("slow") corresponding to the frequences uj and g. But in the degenerate cases 
there are (at least) three time scales of order 0{r]^^^), 0{l) and 0(ry~^/^) ("secular"), as the very slow 
secular scale reflects that the perturbation of the original system removes its degeneracy to order 0{ri), 
relative to the w's size. 

It is remarkable that a second order computation along the lines discussed below, shows that generically 
the splitting 6, with fi = tj'^ , is of 0{i]^'') for some M > 0: hence it is not smaller than any power in ry. 
The latter fact can be used to show, at least in some cases, the existence of Arnold's drift and diffusion 
for any > small enough for systems which are small perturbations of degenerate completely integrable 
systems, see [CG]. 

The loiter problem is still completely open in the non degenerate case and probably requires some really new 
ideas for its solution. 

5) The proof that we give in the case I = 2 and in the general case have different nature: in both cases we 
use explicitly some general results on the existence of {I — l)-dimensional tori, [M], in the form derived in 
[CG]. However we shall show that the I = 2 case is not really relying on such results. We think that also 
the I > 2 cases can be freed from its dependence on [CG]: the reason is the validity of theorem 1 and its 
proof: it should be possible to carry out a similar proof also for theorem 2, if Z > 2. 



§2 Recursive formulae 

In this section we recall some basic facts from the KAM theory mentioned in the introduction and we derive 
from such facts simple recursive formulae for the functions I^, in (1.7) and their time evolution. 

Let us consider, for concreteness, the jj. dependence of the above mentioned (see comments following (1.6)) 
analytic continuation in the parameter fj, of the unperturbed whiskers having = 0. The unperturbed 
motion is simply: 

XO(f) ^(/0(t), 0,(^0(0, a + u;t) (2.1) 
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where {I^{t), </'°(t)) is the separatrix motion, generated by the pendulum in (1.1) starting at, say, ip = w. If 
we call P(I, tp) the impcrturbcd pendulum energy in (1.1), it is P(/", Lp'^) = (sec (1.6)). 
As mentioned in the introduction, under the hypotheses H,H' (see §1) the unperturbed whiskers = 
Wo persist for \n\ < /io = brj'^ and can be analytically continued into whiskers Wj^. 

Let X^{t;a), ct = ±, be the evolution, imdcr the flow generated by (1.1), of the point on given by 

(/^(a, 7r), A'^{a, tt), tt, a) (sec (1.7); from now on we shall fix = tt, which amounts to studying the whiskers 
at the "Poincare section" {(^ = tt}). 

The mentioned analyticity in fi (sec [CG], for instance, §5, §6) allows us to consider the Taylor series 
expansions of the whiskers equations; let: 

X-(i)=X;(t;a) = ^X*=-(t;a)/ = ^X'=-(i)/, a = ± (2.2) 

k>0 k>0 

be the power series in fi of X^, (convergent for /i small); note that X'^"' = X'^ is the unperturbed whisker. 
We shall often not write explicitly the a variable among the arguments of various a dependent functions, to 
simplify the notations. 

Prom KAM theory, it follows that the t-dependence of X^{t) has the form: 

X;{t) = X;{wt, t) = X^iCjt, t; a) (2.3) 

where X'^{ip, t; a) is a real analytic function, of all its arguments (/x included), which is periodic in ^ and a; 
furthermore it has a holomorphy domain: 

V = V^,^„,K,^„ = {\lmi;,\<^, |Ima,|<eo, \Imt\ < Kg-\ < br^Q} (2.4) 

where ^, K, b, Q are suitable positive parameters. And on V the following bound holds (see (6.28) in [CG], 
and [CG] for a complete proof): 

supp |X;(^,t;a) -X-(V^,aoo;a)| < De-\^'st\.^ sup^ |X-(V^, cS)| < D (2.5) 

where a = sign(Rei) and |X| = + + (Jo5)-^(|X+| + 

In particular, if X^" {%l> ^t: a) is the k^^ Taylor coefficient (in the expansion) of the function X^{ip,t;a) 
and if /xq = br]'^ , one has for all A: > 0: 



sup\X'"'{if,t;a)-X''''{if,aoo;a)\ < £) ^''s*!'^, fJ,o = brj'^ 



s\iv\X'"'{il),t,a)\ < D 



(2.6) 



and X'^°'{t) in (2.2) is recognized to coincide with X'^'^(Ljt,t:d). 
We number the components of X with a label j , j = 0, . . . ,21 — 1, with the convention that: 

Xo=X-, (Xj)j=i,...,;_i = X^, Xi = X+, {Xj)j=i^,,,^2i-i = Xf (2.7) 

i.e. we write first the angle and then the action com,ponents; first the pendulum, and then the rotaiors. 
Inserting (2.2) into the Hamilton equation associated with (1.1), we see that the coefficients X'"^{t) = 
X'"^{L0t,t;d) satisfy the hierarchy of equations: 

d^ka = -^ka ^ ^-^ka ^ pka (2.8) 

dt ^ ' 



where: 



6 Jq^ 6 \ I 





^ = ^(*^- I 5Vosin^O(t) \ I' 




-5^/((pO(f),cf + c5t) 
V-95/(<^°(i),a + a;0y 



(2.9) 





and where F^'' depends upon X'^ , ...,X'^^^"' but not on X*^""; here (as everywhere else) the arrows denote 
(/ — l)-vectors. Note that the entries of the {21 x 21) matrix L have different meaning according to their 



position: the O's in the first and third row are {I — 1) (row) vectors, the O's in the first and third column are 

{I — 1) (column) vectors, and (even more confusing) the O's and in the second and fourth column are 
{I — 1) X {I — 1) matrices, while the O's in the first and third columns are scalars. 

More explicitly, and more generally, if we write the Hamilton equations for X = X^^{t), with hamiltonian 
(1.1), as: 

X ^ Go{X) + nG{X) (2.10) 

{i.e. Go = EdHo, G = Edf, E = standard symplectic matrix, d = {dj, d^, 9^, ds)) then, by Taylor expan- 
sion, we can rewrite (2.8) for fc > 1 as: 

21 21-1 rrii 

X^'^=J2id,Gor){X'it))Xf''+J2iGorU{X°{t)) J2 nn^'''+ 



21-1 nii 

|m|>0 (fc*)«,fe-i '=0 J = i 



(2.11) 



where we have used the notation (fc* > 1, mj > 0): 

(Q) ^( ) = ( Ai A,_i f ai ai-i \ , , 

^ ^"^^ V milmslms! ...m;+2! ...ms;! ^ (2.12) 

{kj)m,p = {kl, . . . ,kl^^,kl, . . . ,k^^, . . . jkf, . . . ,k'^^J s.t. k] = p 

and r = 1, ...,2/, and the case k = 1 requires a convenient interpretation (that the reader can easily work 
out, as this is the very easiest case, and as the result must be consistent with (2.8)). 

Expression (2.11) gives in particular a formula for F'^'^ in terms of the coefficients X", X''~^'^ and of the 
derivatives of ifo and /. Therefore, denoting, as above, by indices +, t)— li the components I,A,ip,a, we 
see that, in our special case, (2.8) takes the particularly simple form: 



= (g^Jo sin <p°)X'l'' + F^" , ^Xi" = Ft" 
at at ' ' 

-X_ ~Jo X^ , -X^ -J X^ 



(2.13) 



as F^'', F^" vanish identically, for fc > 1. And if X_ = Xq, for aU A; > 1 it is: 



-1 rui 

j-k^a 



F^- = 0, = 0, F!^" = -Y {daf)rn{^\^t) ^ JJ 11 (^.14) 

|™|>o (fc*U.fe-i»=Oj=i 

m l — l ri ij 

^ x^(5^josin^)^(/)^n^-'- E(^-/)-(^°''^*) E nn^''^ 

|m|>2 {ki)kj=l |m|>0 (^k*Uk_,i=Oj=l 

where {kj)^^k, {kj)m,k-i (are defined in (2.12)). Note that the first sum in the expression for can only 
involve vectors m with = if j > 1, because the function Jog^cosip depends only on and not on a, 
(hence also fcj = if i > 0). We use here the above notation to uniformize the notations. 
The evolution of X'' is determined by integrating (2.13), if the initial data are known. The k = 1 case 
requires a suitable interpretation of the symbols, which we leave to the reader (and the result has to be 
(2.9),(2.8)). 

We recall that the wronskian matrix W{t) of a solution t ^ x{t) of a differential equation x = f{x) in i?" 
is a n X n matrix whose columns are formed by n linearly independent solutions of the linear differential 
equation obtained by linearizing / around the solution x and assuming W{0) = identity. 

The sohibility by elementary quadrature of the free pendulum equations on the separatrix (see for instance 
(3.3) below), leads after a well known classical calculation to the following expression for the wronskian 
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W{t) of the separatrix motion of the pendulum appearing in (1.1), with initial data at t = given by 
<fi = iT,I = 2gJo: 

And the evolution of the ± (i.e. /, (p) components can be determined by using the above wronskian: 

{^t) =Wit) +Wit) fw-\r) (^,."(^^) dr (2.16) 

Thus, denoting by Wij {i,j = 0,l) the entries of W we see immediately that: 

Xl^t) = «;H(t)Xf (0) + wn{t) f woo{t)F'1^ {T)dT - wio{t) f woi{t)FI'' {t) dr 

Jo Jo 

Xl'^it) = woi{t)Xmo) + woiit) I woo{T)Fl%T)dT - woo{t) I woi{t)FI'' {t) dr 

Jo Jo 

The integration of the equations (2.13) for the t, J. components is "easier" yielding: 

Xf'^(i) =Xf'^(0) + / Fl^''{T)dT 
Jo 

[f''{t) =J-i (iXf"(0) + / dT{t- t)F!^''{t)^ 



(2.17) 



X 



(2.18) 



having used that the X'^" = because the initial datum is fixed and fj, independent; and (2.17), (2.18) can 
be used to find a reasonably simple algorithm to represent the whiskers equations to all orders fc > 1 of the 
perturbation expansion. 

We shall regard the two functions X'^'^(f), as forming a single function X''{t): 

X^(t) = |^r!*) if^ = «!g^* = + (2.19) 
^' \X''-{t) ifcr = signi = - ^ ^ 

and at this point it is useful to open a parenthesis to define some integration operations that can be performed 
on such functions; such operations arise as soon as one tries to determine the initial data (still unknown) in 
(2.17),(2.18). 



§3 The improper integration X. 

The operation is simply the integration over t from aoo to t, a = sign*. In general such operation cannot 

be defined as an ordinary integral of a summablc function, because the functions on which it has to operate 
(typically the integrands in (2. 17), (2. 18)) do not, in general, tend to as t ^ oo. 

But the simplicity of the initial hamiltonian has the consequence that the functions X''{t), and the matrix 
elements Wij in (2.15), belong to a very special class of analytic functions on which the integration operations 
that we need can be given a meaning. 

To describe such class we introduce various spaces of functions; all of them are subspaces of the space M. 
of the functions of t defined as follows. 

Definition 1: Let M. he the space of the functions oft which can he represented, for some k>0, as: 

M{t) = J2 ^-^^MJ{x, cot) , x= e-'^s* , a = sign* (3.1) 

3=0 ^' 

with M^{x,ijj) a trigonometric polynomial in -0 with coefficients holom,orphic in the x-plane in the annulus 
< |x| < 1, with: 1) possible singularities, outside the open unit disk, in a closed cone centered at the origin. 
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with axis of symmetry on the imaginary axis and half opening < ^; 2) possible polar singularities at x = 0; 
3) Mk 7^ 0. The number k will be called the t-degree of M. The smallest cone containing the singularities 

will be called the singularity cone of M . 

It is not difficult to see that if a function admits a representation like (3.1), with the above properties, then 
such a representation is unique, (see [CG], eq. (10.15), note that in [CG] M. is called A4). 
It is convenient to consider the functions of t defined by the monomials: 

(E^x'^e''^-^* (3.2) 

where x = 0, 1 and we use the notations in (3.1). We remark that the functions in M can be expanded in 
the above monomials {i.e. the (3.2) "span" the space M.). The functions M G Xi such that the residuum at 
a; = of x~^{Mj{x, •)) is zero, (here the average is over tp , i.e. it is an "angle average"), form a subspace 
A^o of M. 

The functions in M,Mo are not bounded near a; = 0, in general. We denote M and Mo the respective 
subspaces of the functions bounded near x = 0: which means that the Mj have no pole at a; = and, 

furthermore, that Mj{0, tp) = ii j > 0. 

The symbols M'' , Mq , M'' , Mg will denote the subspaces of M,Mq,M,Mo, respectively, containing the 
functions of t-degree < k. 

Note that M E A4 can be written a.s M = P + M' with P being a polynomial in at (with a dependent 
coefficients) and with M' £ Mo- this can be done in only one way and we call P the "polynomial component" 
of M. Likewise M G M can be written as M = p + M' with p being a constant function (with constant 
value depending on a) and M' G Mq: p will be called the "constant component" of M. In both cases M' 
will be the "non singular" component of M. 

The coefficients of the above mentioned expansions and polynomials depend on cr = ±, i.e. each M G M 
is, in general, a pair of functions M'^ defined and holomorphic for t > and t < 0, respectively (and, more 
specifically, in a domain including {aRet > 0, \ lmgt\ < 7r/2}). 

The functions M'^{t) might sometimes (as in our cases below) be continued analytically in t but in general 

M+(— t) 7^ M^{—t) even when it makes sense (by analytic continuation) to ask whether equality holds. For 
the purpose of comparison with [CG] we note that the only spaces introduced there are M and Mo, and in 
[CG] they are called M and Mo respectively. 

Note however that ii M E M the points with Ret = and |Imt| < 7r/{2g) {gt = ±«7r/2 corresponds to 
x = =F*) are, (by our hypothesis on the location of the singularities of the Mj functions), regularity points 
so that the values at t^, "to the right" and "to the left" of t, will be regarded as well defined and given by 

M(t±) = limt,^t, Rct'^Rcti M{t'), in particular A/f±(0±) = M^{l-,6). 
It must be remarked (as this will be essential later) that, since (^°(t) = 4arctane~^*: 

cos^° = 1-^35^^ = 1 -8^, sin^^ = 2j^^=4ax^ (3.3) 

and since / in (1.1) is a trigonometric polynomial, the function F^, see (2.9), belongs to M and, in fact, the 
component belongs to Mq (as accidentally does F^, as well). 

In general if a function M E M i^i holomorphic in t in the strip | Im ytj < j and it has a given parity, then 
it follows that: M+(a;) = ±(-1)^^(3;-^), where the sign is + if M is even, and — if it is odd. This means 
that a^Mj have the same parity for x< — >x~^ as M for t< — > — t. Interesting examples are the functions in 
(3.3). 

It will be checked, by induction, that the functions X'^ and F*^ are in M and their representation (3.1) is 
such that the sum over j runs up to 2k. We say that F'',X'' e M'^'^, see definition 1. More precisely: 

X^eTW^'^-i, F'=G7W2('=-i), F^^gMI^''-^^ (3.4) 

and, furthermore, the singularity cone consists of just the imaginary axis {i.e. the singularities of the functions 
defining X'^, F'^ are on the segments on the imaginary axis {—ioo, —i]^ and +ioo)). See below for details. 
On the class M we can define the following operation. If M S A^, and t = t + i'&, with r, 1? real, and 
T = Het ^ 0, a = sign Ret, the function: 

lRM{t)= I e-^s''''M''{z)dz (3.5) 

J <TOO-\-i'0 
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is defined for Rei? > and large enough, the integral being on an axis parallel to the real axis. 

It M E Ai then the function of R in (3.5) admits an analytic continuation to Rci? < with possible poles 
at the integer values of R and at the values iu) ■ v with \v\ < (trigonometric degree of M in the angles 
and we can then set: 

TM{t)^ j^TnM{t) (3.6) 

where the integral is over a small circle of radius r < 1 and r < min |w • i7|, with the minimum being taken 
over the v which appear in the Fourier expansion of M (which is finite by definition 1). 
From the above definition it follows immediately that if M(t) = P then: 

IM{t) = ^P+^ (3.7) 
and more generally, if j, h are integers and x = 0, 1, the I acts on the monomial (3.2) as: 

^ -Vx+i(£g^ otherwise ^ ' 

showing, in particular, that the radius of convergence in x of TM, for a general M, is the same of that of 
M. But in general the singularities at ±i will no longer be polar, even if those of the Mj's were such. 
In fact one might want to check that IM e M.: this certainly happens if no singularity can appear in 
the {XM)j''s outside the singularity cone of M. Note that M & M. can be written, for fixed a = sign Ret, 
as a sum of finitely many monomials Px~'^e'^^* with r > plus a function M'{t) of the form (3.1) with 
Mj(0,V") = 0. 

If the T operates on the monomials with r > (i.e. on monomials not vanishing at a; = or with a polar 
singulariy at the origin) then it can be explicitly computed by using (3.8), and the check of the claimed 

property is immediate. 

The IM for M such that A'/j(0,i/') = can be computed as an ordinary integral: it is obvious from (3.5) 
that in such case the Xft,M{t) is holomorphic in R for i? = (as the integral in (3.5) is convergent for R = 0). 
Consider first the case in which M{t) = e'"*M'^(a;) with M'^{x) in M and M<^(0) = 0, one gets for all n 
real: 

Jt^ ^inTj^(^^-agT-^ ^ g-iOag-i log ^ ^ ^iQt ^-iQag-^ ^ozVM{yx)^ (3.9) 

which is clearly holomorphic in x in the above considered x plane deprived, outside the unit disk, of the 

singularity cone of M. The more general case, in which we consider Pe'^^*M{x) with Af(0) = 0, is derived 
from (3.9) by differentiation with respect to il. Any other case is a finite linear combination of the considered 
cases. 

Furthermore, if M{t) G Ai and if sign Rei = sign Reto) it follows from (3.8) (as declared at the beginning 
of §3) that the IM function is a primitive of M: 

[ M{T)dT=IM{t)-IM{to) (3.10) 

J to 

In general, I : Ai'' M^^^ , because of (3.7). Furthermore, because of the similarities of the I operation 
with a definite integral, we shall often use the notation: 

/ M(r)rfr = IM{t) , M G M, a = sign Rei (3.11) 

J {a) 

In fact many standard properties of integration are, in such a way, extended to the space M.; for instance: 

/t pO^ pt pt f-T f-t 

= f + , X'^f f =f (t-r) (3.12) 
(a) J {a) Jo J (cr) J (a) J (cr) 

where a = signt and^°^^ means limg.^o'' , of course. 
This leads to a few more natural definitions and properties. 
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If at = sign Ret, if M G M. and if R is large enough, we define I±^rM via: 



I±,RM{t)= [ M(T)e-"<^-^srfr = JijM(0=^)+ / M(r)e-^'^*^3(iT (3.13) 

J±oo Jo 

where we use the definition (3.5), and in the r.h.s. the integral is on the straight line joining to t. This 
allows us to consider the analytic continuation in R of R~^I±^RM{t) and its residue l±M{t) at ii = 0. The 
latter is hnked to the operation T ahcady defined in (3.6) so that the following two definitions (of T±M{t) 
and the consequent one of^_'^M{T)dT) are natural: 



I±M{t)=-f- M{T)dT =IM{0^) + f M{T)dT 
M(r)dr =I_M{t) - I+M{t) 



f 

where the integral from to i is over a straight path joining with t. The (3.14) will be a quite useful 
extension of the operation 1 introduced in (3.11). 

Given the definition (3.14), a natural question arises at this point: is there a class of functions M G A4 
such that the following shift of contour formula: 

/+C30 P+OQ 
M{T)dT=j- M{T + ig-^C)dT (3.15) 
— CO J —00 

holds for all ^ smaller than the complement to ^ of the half opening of the singularity cone of M [i.e. for 
all ^ for which (3.15) makes sense)? A very simple answer follows immediately from the definitions: (3.15) 
holds if M is holomorphic as a function of tin a strip around the real axis, wider than g~^£,. Note that this 
might be at first surprising as the operation -f is an improper integral operating on generally non summable 
functions. 

The (3.15) can be proved by remarking that for R large (and positive): 

e^^^M{T)dT+ / e-^''^M{T)dT (3.16) 
-00 Jo 

which, by the assumed analyticity of M, differs from: 

/O /■+00 
e^''3M{T + ig-^0dT+ / e-^''<^M{T + ig-^^)dT (3.17) 
-00 ^0 

precisely by: 

/O r+00 
dre^'^mir + ig'^S) + {e''^^ - 1) / dTe-^''^M{T + ig-'^^)+ 
-00 Jo 

-i / (e-"'^s - e+-^'^s)M(ir) rfr (3.18) 
Jo 

i.e. (3.16) is the sum of (3.17) and (3.18), by the analyticity properties of M. This implies (3.15) by taking 
the residues at i? = 0, as the analytic continuation of (3.18) vanishes if — > 0, when M e Xio, i.e. it has 
no polynomial component; while if M is a polynomial in t one can check by direct calculation (using (3.7)) 
that both sides of (3.15) give 0. 

It is also useful for the purposes of a better understanding, to realize that, if M is holomorphic as a function 
of f in a strip wider than g~^^, then for t real it is: 



^E/ ^ ^^''^'^^ = f y^e-«-«W^«)M(r + i5-^0rfr (3.19) 
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where the integrals in the r.h.s. have to be considered to be the analytic continuation on R from R> and 

large. This is so because the two sides differ by the residue at = of —iR~^ ^ dT{e~^^^ - e^^^)M{iT) 
which vanishes. 

Note that the r.h.s. is different from ^ '^p'f p^M{t + ig~^S,)dT, which would be the residue ar i? = 

of R~^ J2p /poo e~^^'^^^ M{t + ig~^0 dr. The two quantities coincide under the mentioned analyticity 
assumption, however, if M is in Ado', and one could verify that this remains true if M has no polynomial 
component, i.e. M £ Mq. 

This completes the discussion of the operations I. 



§4 Analytic expressions of the expansion coefficients for the whiskers and the KAM tori. 
Parity properties. 

A): Whiskers. 

We shall show that X'^ admits rather simple expressions in terms of the operation I, (and other related 
operations introduced below). Recall that in §2 we have fixed a G T'~^ and ip = it, and we are looking for 
the motions, on the stable (cr = +) or unstable {a = — ) whisker, which start with the given a and tp = n at 
t = 0; in the following a is kept constant and usually notationally omitted. 

We suppose inductively that e A^2(fc-i)^ ph g ^ ^ ^-^^^ ^ha g _A42fc-i^ f^j. ^ ^. 

see definition 1, §3. This means, in particular, that F^'^,X^'^ can be represented as: 

F'^-(ar,^,i) 
X''-(ar,^,i) 

by setting ^ = cot, a = signt, x = e~^^*; with F^'^^X'-" holomorphic at a; = and vanishing at a; = if 

j > 0. Hnece if x = e~^°'* and ip is kept fixed the Fj'^Xj tend exponentially to zero as i ^ oo, if j > 0; 

while if j = they tend exponentially fast to a limit as i ^ croo {i.e. as a; ^ 0), which we denote F'^{'ip , aoo) 
dropping the subscript as there is no ambiguity. 
Furthermore the inductive hypothesis is enriched by: 

F^'^ {(too) = 6, for all /i< A; (4.2) 

recalling that, in general, a subscript u affixed to a function denotes the Fourier component of order S 
of the considered function. 

We denote X^S{t) and F^S{t) the Fourier transforms in of X^''{t, if) and -Fj"'(t, if). It follows from the 

KAM theory mentioned in (2.6), that X^°'{t) and, from (2.14), hence also F^''{t) are bounded as t aoo 
for all h, so that X^'^(0, V") = if j > 1. We show that the latter information is very strong and permits us 
to determine X'^. 

We note that, since F^'' e A^2(fc-i) pka ^ 5 j^oj^j^ ^j^g function Xl^^'^t), given by the first of (2.18), is 
in fact in Al^Cfe-i) ^j-^y integration). But of course we do not know (yet) the initial data X'"^{0). 

To find expressions for X^ we start from the equations (2.13) with initial time at some instant T. And we 
use that IF{t) is a primitive of the function F{t), see (3.10), so that: 

X'^" {t) = X'^" {T) + IF'"' {t) - IF'"' (T) (4.3) 
where a — signi, and T has the same sign of t. 

The function X^'^(T) tends to become quasi periodic with exponential speed as T ^ aoo: in fact it becomes 
asymptotic to the j = component, see (4.1), at a:: = 0: X^f (0,aJT), (in the sense that the difference tends 



2{h-l) 



-,F^%x,i;), h = i,...,k 



j=o 

2h-l 



(4.1) 



Y,-^t{x,1p), h=l,...,k-l 

1=0 
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to 0, bounded by (5f|T|)^*' ^1^1). The function JF^"^ (T) also becomes asymptotically quasi periodic with 
exponential speed and average, because F^"' e M.q^''~^^ and by the definition of J: therefore the two quasi 



periodic functions of T must cancel modulo a constant equal to {Xq^{0, •)) = X'^^{aoo). 



Hence it follows that: 

X^- (t) = X'^^ (ctoo) + IF^^^ (t) (4.4) 

and, by inserting (4.4) into the second of(2.18), (considering also that the time average of TF^°' vanishes, 

and therefore tF^"^ (t) dr — tIF^"{t)+ a f-bounded function), we see that the X^"{t) can be bounded 
only if: 

X^S{aQo)=^Q, hence: X'^" (t) =IF^'' {t) (4.5) 

yielding, setting 1 = 0", the initial values of X'^ and a simple form for its time evolution. Analogously, 
recalling that X|^'^(0) = 0, essentially by definition, one finds: 

Xf'^(t) = J-i (l^Ff^it) - I^F^'^iO")) = J-'^T^F^''{t) (4.6) 

which gives a simple form to the time evolution of the a {i.e. |) component of X^ in terms of the operator 

defined by the r.h.s. of (4.6). 
Likewise considering the (2.17) and the behaviour at croo of W in (2.15) and recalling that X'''^{t) has to 
be bounded at aoo by (2.6), we see from the second of (2.17) that: 

paoo 

(0) = - / w;oo (r)Ff(r) dr (4.7) 
Jo 

Thus we get (defining at the same time also O and 0+): 



(t) = wii{t) / woo{T)FmT)dT - wwit) f WoiiT)Fl''{T)dT = O+F^'^ (t) 
J (cr) Jq 

X'mt) = WOlit) -f W00{T)F'^^{T)dT - wooit) f WoiiT)Fl%T)dT = OF^'^ {t) 
J (a) Jo 



(4.8) 



The (4. 5), (4. 6), (4. 8), and (2.5) imply (2.2) for ft, = fc + 1. As already remarked before (4.3) we note again 
that, since F|*5((too) = () for ft < fc, the X^~^^ ,X^^-^ functions are in fact in A4^'^'^~^\ (as the I operation, 

on such F^ functions does not increase the degree). Also, if one looks carefully at the AT^'^-evaluation in 
terms of one realizes that the 0, 0+ operations may increase the degree but by at most 1. Thus the 
inductive hypothesis made in connection with (4.1) is proved for X^'^^ , and it remains to check it for F^~^^ . 
The latter check follows from the expression of F^^"^ , see (2.14), in terms of the X^ with h < k: see (2.14). 
One treats separately the sums in (2.14) with |m| > 2 and |m| > 0. One just has to consider that in the 
first case, which might look dangerous for the inductive hypothesis, the products of X's contains at least 
two factors (which therefore have order labels smaller than k and verify the inductive hypothesis); and, 
furthermore, the coefficients idsf)m{'fOi'^t) or g^Josintpo or ^^Jqcosi^O) by (3.3), do not contain any terms 
that can possibly increase the degree. Hence F^'^^ G M."^^ . 

To see that F^^^^^" G A^o*^, i.e. F^'T^'^'^" = (), we simply remark that otherwise the second of (2.18) could 
not be bounded in t as t ^ oo; but we know that it is bounded by (2.5). 

Remark: The use of (2.5) is clearly a spurious element: it should not be necessary to invoke a rather 
involved analytic discussion {e.g. the "KAM" theorem yielding the (2.5)) to prove an algebraic fact, namely 
F^^^^"^ = 0, that if not valid would prove the claims of the theorem leading to (2.5) false. At least it is 

unpleasant to do so (although logically consistent) and a direct check of the algebraic property is highly 
desirable: it would show that (2.14), and the equations F'^S = 0, and (4. 4), (4. 6) and (4.8) yield recursively 
a formal power series expression for the whiskers. 

Such a check is possible, see [CG] appendix 14- This means that the property F^^ = and the (4.4), (4.6), 
and (4.8), coupled with the initial condition (2.9), always have a solution verifying (3.4). It gives us a formal 
power series solution to the problem of finding the whiskers equations (and those of their tori as well, which 
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are easily related to the t — > ±00 behaviour of the X'^ functions). 

The convergence of the scries, however, docs not follow from what said so far. In the case / = 2 it will be 
checked in §8, so that the I = 2 case can be made fully independent on the KAM-type results expressed by 
the (2.5) (and imply them, of course). If Z > 2 we still rely on the KAM results for the convergence, although 
it will be clear that with some extra work it should be possible to obtain convergence estimates along the 

same lines as in the I = 2 case. 

We can summarize the above analytic considerations as: 

F;=5(<70o) ^ /^,_, Ff-(V;,<700)(2^ ^ (^'^(•,C700)) = (4.9) 

for all fc > 1, and, still for all fc > 1, by: 

Xl{t) =wai{t)I{wooFl){t) - woo{t){T{waiFl){t)-I{woiFlW)) = 0{Fl){t) 

Xj'(f) =J-i {l'^{F!^){t) - j2(Ff )(0'^)) = J-^I{F!^{t) (4.10) 

xX{t) =wn{t)I{wooF'^){t) - wio{t){l{woiF!^m-I{woiF!^){0'')) = 0+{F'^){t) 

(t) =X(Ff )(t) 

where 0,0+,J^J are defined here (and in §3); and X^ = {X_ , X^, X+, Xi^) = {X!^), j = 0,...2l - 1, 
F'* = (0,0,^'^,^^'*) so that — , + are synonims of 0,/ respectively and J,,t denote collectively the labels 
j = 1, . . . ,1 — 1 and Z + 1, . . . , 2Z — 1 respectively (see also §3). Note that while X'* has non zero components 
over both the angle {j = 0, . . . ,1 — 1) components and over the action {j = I, . . . ,21 — 1) the F^ has only 
the action compnents non zero. Furthermore if at > the above functions describe a motion on the whisker 
W with initial data at some a and ip = it. 

B): Tori: 

A case of special interest is the case in which / in (1.1) is t/J-independent. In such case the pendulum 
and the rotators decouple and we are really studying the perturbation theory of a completely integrable 
{I — l)-dimensional system, of rather special form, namely that of model 1) in (1.1). 

The whiskers will, in this case, be degenerate and, at <^ = tt, have the form: 

a ^ {X^i6;d),Io{TT)) (4.11) 
and we can deduce the geometric locus of the torus, by letting ip vary in T'~^, via: 

A = Xt(^, aoo; 0) = Er=i /^'^"(V^, «^oo; 0), a = + Er=i M'=^f(V^, «^oo; 0) (4.12) 
and, in fact, in this case X'^ {ip,t:0) is t-independent, and also a independent. 

If < Jo < J < +00 we can use the general KAM theory to say that the above invariant torus does exist 
for < small enough. The analysis in A) of the whiskered tori then yields, in this particular case, that 
the motion t — » X{t) = {Xi{t), X-^{t)), given by the following (4. 13), (4. 14), is a quasi periodic motion on the 
invariant torus: 

X|'(i) = J-'{l^F^'{t)-J^F^^{0)), X'^it) = IF'^it) (4.13) 

where we have dropped the a labels as the functions F,X no longer depend on them. And, by (2.14), if 
m = (mi, . . . ,mi_i), k] > 1: 

/ — 1 mi 

|m|>0 (fei.)rti,fc-i»=lJ=l 

because the pendnlum and the whiskers really disappear as a consequence of the decoupling. 
The analysis in A) of the whiskered tori can be repeated to show that X^,F^, for i = 1, ...,/ — 1 (J, 
components) and for i — / + 1,...,2/ — 1 (| components), are in M'^ for all h > 1. This is a stronger 
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statement than (3.3), vaUd only for the X'^ components and in the present special case: note that in our 
notations , with the label as superscript, means that not only the functions in M.'^ are bounded (i.e. in 
M), but also that they have f-degree 0: see definition 1, §3. 
Furthermore: 

F'^S = (4.15) 

As in A) the property that F!^^ = is derived from the KAM theorem, (as done in (4.9)). But also in 

this case one can check it directly. In this way we free ourselves from the KAM theorem and we arc in a 
position to study it independently. This means that the property F^S = and the (4. 13), (4. 14), coupled 

with the initial condition F^{t) = dsf{ojt), (see (2.9)), always have a solution with quasi periodic (with 
spectrum uj). Such solution gives us a formal power series solution to the problem of finding the invariant 
tori equations. In this case we shall discuss and prove the convergence of such scries. The first proof ("non 
KAM") of (4.15) is due to [CZ]: in §6 we shall find a more direct proof of (4.15), see remark 3) after (6.18): 
which is very simple although far less general than [CZ]. But for clarity of exposition we prefer to appeal to 
[CZ] and continue, without stopping to prove (4.15). 

The above statements do dot require that J be a constant: it can be, for instance, a diagonal matrix (of 
dimension I — 1), provided we interpret conveniently the multiplications by J~^. It is then interesting to 
note that all the above statements remain true if some, or all, the elements of the diagonal matrix J become 
infinite. This is not difficult to check by going through the classical proof of KAM theorem, (it is, in fact, not 
easy to find it explicitly stated except in the case in which either only one of the Jj is infinite ( "periodically 
forced systems") or all of them are infinite). However one finds a result with a analyticity radius (in fj,) 
which, as mentioned in the introduction, is not bounded away from uniformly in the size of the largest 
among the non infinite Ji 's. Although this is nevertheless sufficient to prove that F!^^ = 0, it IS reassurmg 
that the algebraic check, that we refer to above, is independently possible. 

// we let J be a diagonal matrix and we allow for some, or all, its elements to be +oo and if we set rj = 1 in 
(1.2), and ask whether the torus ^ = can be analytically continued for ji complex, j/x] < 6~^, with b being 
J -independent, then we say that we are considering a "twistless KAM" problem. 

The convergence of the formal series for the tori equations studied in §7 will yield a radius of convergence, 
b~^, independent on the Jj (a result stronger than the usual KAM theorem relying on the twist property). 
Hence we show, by direct bounds, that the just posed twistless KAM problem has a solution (a fact that 
could be checked also by a careful exam of some of the classical proof of KAM theore, as mentioned in the 
introduction). 

This is less surprising if one studies the J = +oo case, see [Gl]: problems 1,16,17 §5.10, showing that the 
system is completely integrable. In the language of the present work the quoted reference [Gl] can be easily 
worked out: one finds that X^ = 6ifh>l and F^^ = 6,X!^ = 6ifh> 2, while X^{t) = I{dsf){t), so that: 

A = A{a) = - fi^ — ^ . a = dGT'-i (4.16) 

are the equations of the invariant, twistless, torus arising from the unperturbed ^ = torus. 
C) Parity properties. (I). 

We close this section by pointing out a few parity properties of the operator I, which will be useful below 
(see [CG], §10). 

Suppose that M depends on other I — 1 dimensional angles a as a linear combination (with a-independent 
coefficients) of monomials: 

^-^x'^a^^ cos.K'^ ■ut + a-fl)= ''I^x^a^^ Y pX ^^Pi^-^t+fi-a) (417) 
p\ p\ 2ix ^ ' 

with X, x' = 0, 1 and cos^' y = cos j/ if x' = and cos^' y — sin j/ if x' = 1, then IM has the same form. 
We shall say that M is time-angle even if X + x' = even for all monomials of M. If, instead, x + x' = odd 
for all monomials we say that M is time-angle odd. It then follows that the time angle parities of M and 
IM are opposite (when either is well defined). 
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After the above remarks we make the inductive assumption that F^'^{t;d) has action components (+, t)) 

denoted symbolically d, of odd time-angle parity in the above sense and angle components (— denoted 
p, of even time angle parity. Opposite parity assumptions will be made for X''°'(t;a). We shall write: 



ph. 



\ 



(4.18) 



dropping the label a from F and X. In fact the main goal of the above formalism is to treat simultaneously 
the stable and the unstable whiskers: for t > it is cr = 1 and F^, represent F'^'^, while for t <0, 
a = —1 and F^,X^ represent F^~,X^~. Hence we can symbolically write: 



F''=J2Sx'' (gatfa^ cos^> (u ■ ut + a ■ ft) 
X^ = ^£,x'' [gat)^' cos^' ■ vt + a ■ fl) 



(4.19) 



with suitable cr-indopendcnt coefficients (5,^ and X + x' = even for the (+,t) components and odd for the 
(— ,i) components in the case of X, and with reversed parities in the case of F (the symbol cos^y being 
defined after (4.17)). 

By remarking that X{t) can be expressed via the wronskian: 



X{t) = W{t)[x{Q) + j W{T)-^F{T)dT 
where the wronskian matrices W{t) and W{t)~^ are respectively: 



(4.20) 



/wooit) woi{t) \ 

1 J-H 

wio{t) wii{t) 

V 1 



J 



( wnit) 

1 

-wio{t) 

V 



-woiit) 


Woo{t) 







-J-H 


1 



(4.21) 



{wjq, j, q = 0,1, being the matrix in (2.15), with Wjj even in t and wio,woi odd in t), one deduces immediately 
from the above property of X that X^ will have the opposite structure to F^ {i.e. if F^ = then 



X^ = 



)• 



The above remark and (2.11) imply that if X^' has the structure ( \ iov h' < h then F^ has 



structure. And since it is obvious that F^ has 



structure, the (4.18) follows by induction. 



An important consequence of the parity of X^, X^ is that if a = they arc even functions of t so that, see 



(4.17), if X = it is x' = 1 and viceversa. Hence X-^ (0, 0) — X-^ (0, 0) and we see that a = 0, = tt is a 
homoclinic point (and more precisely aj = 0,7r, and <^ = tt ar 2'"^ homoclinic points. In the following we 
study the homoclinic point a = 0,ip = it. 

D) Parity properties. (II). 

It is important to study the following operators: 



p=± P°° 

Vf{t) =^ 1] f {wQl{t)woo{T) - Wol{T)woo{t))f{T)dT 
^ „ u ^ poo 



(4.22) 



Then one checks that U, V preserve the time angle parity of f as well as the holomorphy. This is not the 
case of the operation I: the latter has floating integration axtremes (depending on the siign of t): therefore 
it maps analytic functions of t into functions with a possible non analyticity at Ref = 0, unless the function 
is odd in t. 
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Hence if / is analytic in t in a strip Imt < ^ and time angle even it will be: 

U f{0") = a u]; cos ad - u + Uj; sina- u (4.23) 

for suitable coefficents u^,u; and similarly for V. But (4.23) has to be analytic; hence: 

C//(0) = ^Ui7sina- i;, V f{0) = Y^Vc!smd ■ u (4.24) 

for suitable coefficients up, vp. If / is analytic and odd we exchange the role of sines and cosines. 
Note, as implicit in the discussion of C) and D) above, that if / time angle even and analytic then Tf{Q"') 
has the form X]i7(/isina • i7 + /|(t cos cf • z7) , i.e. the generally present (in the non analytic cases) part 
f'ia cos d ■ V is not necessarily 0; and if / is time-angle odd and analytic then 1/(0*^) = X^^(/i o sin a • 
V ^ pvcosd): i.e. "no simplification" occurs because of analyticity. 

We see that the parity properties, although very simple, can become quite intricate to visualize. 



5 Trees, roots, nodes, breinches cind fruits: the formalism. 



We develop a graphical formalism to represent, via (4.10) and (2.14), the generic /i-th order contribution 
to various quantities related to the invariant tori and their whiskers. 

We shall consider, for instance, the whiskers splitting in model 2), (1.1), at the point with coordinates (/^ = 
and d, which is A*^(c5) = X*^^(0;q!) — Xj^~(0;a) if j is an action component subscript (hence a homoclinic 

point (at = 0) corresponds to the values A such that A^(cJ) = 0). We shall also consider, as a second 
example, the homoclinic scattering phase shifts (a notion introduced in [CG]), and as a third example the 
parametric equations of the invariant tori in model 1). 

The case of model 1) can be regarded as a special case of model 2), with / independent on the pendulum 
position if, hence the two problems can be treated with the same formalism: although the latter is simpler 
and, if independently formulated, would require a slightly easier analysis. 

Recall our label convention, in (2.7), for the action angle variables: we label with j = the angle, 
with j = 1, — 1 the angles ai, . . . , a/_i and we label with j = I the pendulum action I and with 
7 = Z + 1, . . . , 2Z — 1 the rotators actions A. The are given by (2.14) and the are related to the F'' by 
(4.10). 

Note that the action angle labels are enumerated, between and 21 — 1, in the order in which they appear 
in (4.10): because in (2.14) a subscript — or + is synonimous of a subscript or I; and | or "f denote, 
collectively, the subscripts j = 1, . . . ,1 — 1 or j > I. Furthermore F'' depends only on Xj, h < k, with 
j = 0, . . . ,1 — 1, {i.e. it depends only on the the lower order — , J. components of X'^; this is a consequence of 
the / in (1.1) depending only on the angles cp, d). 

We imagine to use (2.14) recursively to express everything in terms of F^ only. The structure of (2.14), 
(or, more generally, (2.11); see also [Gl] Ch. 5, §11), leads us naturally to a graphical representation, quite 
familiar in perturbation theories, see [G2] . This fact was noted explicitly in the context of KAM theory by 
[E], (see also [V]). 

We can see a tree grow out, of (2.14), (4.10) as follows. 
First we represent F^{t) as a fat point v: 

F^{t)= (5.1) 

Then we consider the improper integration operations with upper limit t, denoted O, J^^T , 0+, I in (4.10). 
We represent them with a line segment of unit length, called branch, joining two points r, v, and calling r 
the branch root, and v the branch node. The branch will bear a branch label j = when representing O, or 
a label j = 1, . . . , Z - 1 for J-^J^, ov j = I for 0+, ov j = I + 1, . . . ,21 - 1 iov I, see (4.10). A label t will be 
attached to the root r: 

I • 'V (5.2) 
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Then we can represent the whole formula (4.10) as: 

X^{t) = * . ^ (5.3) 

and there is an impHcit constraint as j, J cannot be independent, as one sees from (4.10): in fact j = => J = / 
(first or third Hne in (4.10)); and j = \,...,l— l^j = j + l (second line in (4.10)); and j > I => J = j 
(fourth line in (4.10). 

Going back to the representation of Fj'{T) as a fat point, (5.1), we can use (2.14) to express it in terms of 
Xf^ir) with E!=o Er=i K = hovh-l. 

Using the representation (5.1) for Xs*'(t) we see that (2.14) can be written, {j > I): 

T,h°,l 



(5.4) 



where the first mo branches are labeled 0, the next nii are labeled 1, etc.; 5 = 0,1 and /i^ = /i — 5, because 

of the meaning of the symbols {kp),Ti^k, (fcp)m.fe-i in (2.14). 

The vertex v, that we call a node, corresponds to the factors ml {g^Jo sm.ip)^{ip'^) and ml {df)^ appearing 
in (2.14) and deprived of the combinatorial factor ml = Ilto™''' (2-14), (2.12). And the label v is 
introduced to split such factors as sums of their Fourier components. Namely, see (1.1), let: 



f{a,ip)= /.*e^("^+^-), ^ = 0,1 

i/=(n,i7) 

f{^, d) ^ Jog' cosip=Y Y e'"^ fi^^ ^ E f ^^'^^ 



p=0 

=±1 



then the factor represented by the node v, with its labels r, v, 6,j is: 



^ • {Ylii'^sr^) fy^^^°^^)+-<^+'^^)) (5.6) 



.s=l 



where j — / is the branch label of the branch leading to v, and the introduction of the above Fourier 
representation is convenient as it eliminates the derivatives with respect to ^p, a. in the coefficients of (2.14). 

We call T a time label, v a mode label, 8 an order label and J an action label associated with the node v. 

One can further simplify the (5.4) by imagining that the m = X^^-qTOj branches coming out of v are 
distinguished by a number counting them from 1 to m and appended to them, which however is not explicitly 
written and is thought as associated with the branch. Then we drop the condition that the branch labels 
j = 0, . . . , Z — 1 are in non decreasing order and rewrite (5.4) as: 

^ ml 



T,hm,3n 
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so that we can put freely the labels without worrying about the order (if we did not replace the combinatorial 
factor in (5.4) with that of (5.7) we would count the same term mj!"-^ times and this explains the 

change in the combinatorial factor). 

It is now easy to write (4.10) with the F'^ in it expressed in terms of lower order , h' < h, via (2.14): 
thinking as written in (5.3) and (2.14) as written in (5.7), we get: 



where the sum is over all the possible choices of the labels, subject to the conditions described in their 
definitions {e.g. j^g =j). 

We can now iterate the representation (5.8) by replacing the fat points by nodes ont of which new branches 
emerge, until we reach fat points with order labels h= 1, where the expansion ends and we call the last fat 
points top nodes, drawing them as slim as the other nodes. The root r of the first branch drawn will not be 
regarded as a node, but it will be called root. 

We see that the above expansion leads to a sum over trees bearing " decorations" : such trees have branches 
emerging from the same node which are regarded as distinguished by a number, never marked in the drawings, 
see fig. 1 below. On such trees a natural group of transformations acts: it is generated by the following 
operations; fix a node and permute the subtrees emerging from it. Two trees that can be transformed into 
each other by a transformation of the group just defined will be regarded as identical. 

^vi jvi 

V4 V8 



Vn 

fig. 1: A tree i? with m^g = 2,m^i = 2,mv^ = 3,m^3 = 2, m^^ = 2 and m = 12, n{'&)=n{^i)n{'&2) 
= (1 • 1^)(2!2!) = 4, H'^i'! = 2^-6, and some decorations. 

We call the above trees semitopological trees: we call topological trees the trees in which the branches 
emerging from the same node are not numbered, and also such trees will be identified when superposable 
modulo the action of the same group of transformations. 

A third type of trees, that we call "numbered trees" or simply trees, are obtained by imagining to have 
a deposit of m branches numbered from 1 to m and depositing them on the branches of a topological tree 
with m branches: numbered trees that can be superposed by transforming them with the above group 
of tranfsormations will still be regarded as identical. The number of such trees asociated with a given 
semitopological tree is ml Y[y rnyl~^ and using such trees, which will be the only ones considered from now 



(5.8) 



0" 



Jx 



root 



Vo 
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on, unless otherwise stated, the (5.8) becomes: 



^i^w= i: E N-ij (5-9) 

j^etrees labels; V S^=h 

where m(i?) = number of branches of and the drawing of Fig. 1 symbohzes, now, a well defined hierarchical 
chain of operations of improper integrations.^ 

The expression (5.9) is very convenient, as we shall see, from a combinatorial point of view. It is however 

clear tha the summation in (5.9) over the labels produces a number w({f) depending only on the topological 
tree associated with t?, i.e. it does not depend on the branch identification numbers (hidden in the pictures) 
of the branches. Therefore in real calculations the above sum over trees can be replaced by a single sum 
over semitopological trees, or by an even simpler sum over topological trees. ^ 

Let us fix some terminology and conventions: given a tree and a family of labels for it we denote O their 
pair and call it a labeled tree. 

the trees are dawn from left to right and are regarded as partially ordered sets of nodes in the obvious sense 
(see footnote ^ above), with the higher nodes to the right {i.e. we draw them as "fallen trees" , unfortunately: 
the vertical notation would have required too much space). The branches are naturally ordered as well; all 
of them have two nodes at their extremes (possibly one of them is a top node) except the lowest or first 
branch which has only one node, the first node wq of the tree. The other extreme r of the first branch, 
(which is the root of the branch), will be called the root of the tree and it vjill not be regarded as a node. 
If vx and are two nodes we say that v\ < V2 if W2 follows Vi in the order established by the tree: i.e. if 
on has to pass vi before reaching V2, while climbing the tree. 

each node carries a time label t„, a mode label Uy, and order label Sy and an action label jv = I, ... ,21 — 1. 
Each branch carries and angle label, j\ = 0, . . . ,1 — 1; but if A is the root branch its label can also be an 
action label j\ > I. If A leads to v then j\ = Jt, + / if Ja = 0, . . . , ^ — 1, while J„ = jy if jx = I, . . . ,21 — 1 
(which is allowed only if A is the root branch). 

the order h{Q) of the labeled tree O is ft. = Sy, i.e. the sum of the order labels of the nodes. The number 
of branches emerging from the node ii is m = 1 + m„ (as one has to count also the root branch). Of 
course, as the order label Sy = 0, 1 and as each node v with Sy = must have m„ > 2, (see above and 

(2.14)). it is h < m < 2h. 



More formally a topological tree is a partially ordered set with the property that any two elements follow some common 
predecessor (hence there is a minimum element, or root) and have no common follower if they are not comparable. This is 
visualized (sec fig. 1 above) by representing its elements by oriented unit segments. We draw the first segment (in the partial 
order) and attach to its endpoint the myg > segments representing its immediate followers, and so on. The ciidijoiiits of the 
segments become nodes of the tree: to each node v is associated a branching number mv > as the segments will be called 
branches. The first branch will be called the root branch and its first point will not be a node, but it will, nevertheless, be 
called the roof. The endpoint Do of the root branch will be called the first node. Two topological trees giving rise to the same 
partially ordered set will be considered identical. Each node v with my > can be regarded as the root of a subtree: and the 
operation of graphical permutation of two subtrees emerging from the same node establishes an equivalence relation between 
trees which is a notion coinciding with that of giving rise to the same partially ordered set. 

It is cleaj- that the set of topological trees with m branches contains at most as many elements as the random walks on the 

lattice of the integers > 1 with 2m — 2 steps: hence the number of topological trees with m branches is bounded by 2^"^"^. 

^ For this purpose the following identity is used. A natural combinatorial factor n(i?) that can be associated with a topological 
tree ^ that bifurcates at the first node vo into nivQ subtrees among which there are only q topologically different subtrees 
. . . , i9q each of which is repeated pi, . . . ,pq times, is: 

M^) =111=1 Pi'- ^(^i)"' (5-10) 

Such combinatorial factors are useful when one has to consider sums over trees ■& with m branches (root branch included) of 
functions F{'&) whose values depend only on the topological tree. In such cases one finds: 

T^. S,Jg trees ^^'^'^ = '^■ffe^topological^i^ve^ m„!)^W = '^^etopological n(ij) ^('^) i^-^^) 

The above identity is closely related to Cayley's formulae, see also [G2], (6.1), (6.2) and (5.13)), and [FG]. It simply reflects 
that the number N(m; {mv}) of trees with m branches and branching numbers m^, hence generating the same semitopological 
tree, is ml m^l^^ (which is (an adaptation of) Cayley's formula, (see [H], §1.7)), while the number of trees generating the 
same topological tree is m! n(i9)~^. 
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the momentum of a node v or of the branch leading to v is z7(v) = J2w>v ^w-, if = {nv, I'v) is the mode 

label of V. The total momentum, is i7(fo) = Y^^yy^ 

Given all the above decorations on a labeled tree Q we define its value Vj{t; O) via the following operations: 

(1) We first lay down a set of parentheses () ordered hierarchically and reproducing the tree structure (in 
fact any ordered (topological) tree can be represented as a set of matching parentheses representing the tree 
nodes). Matching parentheses corresponding to a node v will be made easy to see by appending to them a 
label V. The root will not be represented by a (unnecessary) parenthesis. 

(2) Inside the parenthesis („ and next to it we write, setting jy = jy — I, the node function, see (5.6): 

f -5«('^«)j„/':e'(""'^(--)+(«+--'')-'^) ni=o(*^-)"" (5-12) 

here jy is the branch label of the branch leading to v. The last product is missing if no nodes follow v. (3) 
Furthermore out of („ and next to it we write a symbol £y which we interpret differently, depending on the 
label / = on A„ and on the action label /' = jy on v: 

^/ N f^f •) (^^'), ifv>vo , jA„ =0; 

\v Jv 1 j-iT ( . 1 C^..A iiv>vo , l<j\,<l-l; 



for V > Vo, otherwise: 



0{ ■ (tn , iiv = vo, JA„ = , 

\v / V 

J-^t{ ■ ) [t^) iiv = Vo , l<jx^<l-l 



(5.14) 

0+[ ■ ] (O , if^; = ^;o , 3x^=1; 



j( • ) (i<^) , ifv = vo, l + l<jx,<2l-l 



t being the root time label of the tree and the superscript a attached to t is important only if f = 0: in 
such case (5.14) has to be interpreted as the limit as t — > 0'^. 

Remarks: 

1) One realizes that the giant symbol thus constructed has a perfectly defined meaning as a hierarchically 
ordered chain of operations 1: it gives a "single contribution" to the value X'^jy{t; a) = Xj^{t)e^"''^ . For 
instance: given a = ± multiply a times all the above values of the trees 6 with order /i(6) = h, momentum 
j7, and j > l.t = O*^. Such sum will give the h-th order contribution = Xj'(0+;a) — Xj{0~;d) to the 
homoclinic splitting. 

2) : If we do not perform the operation S'^ relative to the time Ty^ of the first node vo and set it to be equal 
to t, setting also j = jy^, we see that the result is a representation of Fj^ {t), if j^^ is the label of the node 

Vq. 

3) : Note that if a = then we are at a homoclinic point, because the hamiltonian (1.1) is even: so that 
the sum in remark 1) above yields a value for all components j = I, . . .21 — 1, see the final comments in 
§4, C), and all fixed tree shapes. 

4) : Suppose that one whishes to study model 1) in (1.1). For instance suppose one wants to determine 

the parametric equations for the invariant torus run quasi periodically with angular velocity uj, analytically 
continuing the unperturbed torus A = 0. The (4.10) must be replaced by (4.13) and (2.14) by (4.14), while 
(4.9) remains the same. But very little has to be changed in the above graphical representations. 
In fact it is sufficient to consider, in the sum Xj'-(t) = X^q. ^(q-j^/j V^i7(i; 0)6*"^ ", only trees 8 with no 
node or branch labels j equal to 0,1 {i.e. all the labels have to be rotators labels); and dy = 1, so that 
m(©) = h{Q) = h. This is obvious because we can think that the pendulum is present but that it is 
completely decoupled from the rotators. 

The functions X^{t),X!^{t) thus defined are, of course, the solutions of the equations of motion of a point 
starting at t = on the above invariant torus T at a angular position a and at action position A^{0) = 
Yl'h=i l-''^X!^{0), for /i small enough (this is a convergent scries for /i small, at fixed Jj, by KAM, as already 
mentioned): if. in view of the aim of proving the theorem by directly showing the convergence of the series, 
one does not want to use the KAM theorem then the latter series will only be a formal power series solution 
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to the problem of finding a motion on the invariant torus T. 
Therefore setting: 

oo oo 

d{t) = a + ut + J2l^^^lit)^ At) = ^l^^^-^{t) (5-15) 

h=l h=l 

and letting t vary in [0, +00), or (—00, 0] (or, for that matter, in any infinite connected subinterval of the 
above two) wc describe (at least formally) a dense set on the torus; i.e. the torus itself. 
The formalism necessary to see the cancellations, and to make use of them, is completed. 



§6 Cancellation mechanisms. 



A): Homoclinic cancellations. 
1) (representation of£"^): 

To study cancellation mechanisms for the whiskers splitting we consider trees with root time t — 0, {i.e. t = 
0^), and we introduce a useful representation of the £ operations. Given a tree -d contributing to order h and 
a total momentum j/, let w be a node. We fix our attention on one such node v and call v' its predecessor: 
the case in which u is a top node and the predecessor v' of v is the root is the simple first order case, so that 
this case will not be considered as it can be studied easily by direct evaluation and no cancellations occurr 
in it, in general, (hence v' will not be the root and v > ijq)- 

Supposing that v' is not the root, we realize that the "integral" describing the I operation (associated with 
the vertex v) can be written, if j = is a J, component (i.e. if < j < Z — 1, see (4.6)): 

('^■"' ~ Ty)Sy^^{Ty)dTy " ^^^^ ( "Ty ) 5^ ,^ (t„ ) rfr^^ (6.1) 

with Sy^^{Ty) being the result of the operations performed to evaluate the integrals and sums inside the 
parentheses {y)^. Note that the Sy^{){T) depends only on the subtree t?^ C i? rooted at v' and consisting of 
the nodes following v va. (and bearing all the decorations of 
Calling t = Tyi, T = Ty, using (3. 13), (3. 14), we can replace the above expression by: 

J-i (x(<7 < 0)tftZSvAr)dT +f\^it - T)SyAr) dr -fl^i-T)SyAr) dr) (6.2) 
Note that a rearrangement has been necessary to simplify the term with the characteristic function as well 
as the other; here we define to be identical to^'^_^ — (3.14). 

A more symmetric representation is obtained by noting that a "mirror" formula must hold with — oo playing 
the role of +oo. Averaging over the two formulae for the same quantity (6.1) we get: 
-j^ f-+oo 1 r 

+ 



^ ^ J -00 ^ +00 J -00 

I f° \ 
+ o(r rSyA^)dT+f- r5,,^(r)dT) (6.3) 
^ J +00 J -00 ' 



+00 

such symmetrization is not really needed, but it has some aestethic value 
A similar representation can be achieved also for the case j = = 0: 

"+00 



X{cr < O)woi{t) Woo{T)Sy^^{T)dT + (woi{t) w;oo(T)S'„,^(r)dr- 

J —00 J +00 

{t)f- Wol{T)Sy^^{T)dT] +Woo{t)j- Wol{T)Sy^^{T)d7 

J +00 ^ J +00 



Woo{t)-l- Wol{T)Sy^^{T)dTj +Woo{t)j- Wol[T)!yy^^[T)dT 

+00 •/ +00 

see (2.17) and symmetrization over the choice of ±00 yields the more symmetric formula: 



(6.4) 



1 f+°° 

-awoi{t)f- woo{T)SyAT')dT+ 
^ J -00 



+ 2 



^ir (woi{t)woo{T) - woo(t)woi{T)\Sy^^{T)dT + f- (same)dT+ (6.5) 
^ +00 ^ ^ J -00 ^ 

\woo{t)(^j- Wol{T)SyAT)dT + j- Wol{T)SyAT)dT^ 
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We describe the above representation of the contribution of the considered tree branch by affixing a "bubble" 

around the node v and enclosing all the branches following it: the bubble will have a wiggly boundary or a 
smooth boundary to distinguish between the first and the third terms in the above sums. If a node is not 
enclosed in a bubble that cuts the branch linking it to the previous node then this means that in the above 
sum we selected instead the intermediate term: in this case we mark the node v by a label TZ (in analogy 
with renormalization theory, see [G2]; here TZ has nothing to do with the parameter R appearing in the 
definition of T) . 

We can repeat the same representation operation on the (improper) integrations pertaining to all the other 
nodes, starting from the highest on the tree and going down towards the root: we stop at the first nodes 
following Vq (the latter node is somewhat different from the others as the corresponding £^ integration has 
a un upper limit t fixed). 

Thus the most general contribution to Xj (t) by trees with order h will be represented by a tree with h nodes 
with index S = 1 (and up to 2h — 1 branches, see c) preceding (5.12)) and all the labels introduced so far 
(TZ labels included) plus an arbitrary number of bubbles drawn around tree nodes above va, and drawn as 
described (avoiding overlappings). Of course we could replace the bubbles with labels affixed on the nodes 
following the root: but we prefer the "bubbles notation" as it reminds us of the analogous bubbles that can 
be used to describe the renormalization cancellations in quantum field theory, (see [G2]). 

We call free the branches that are not enclosed inside bubbles of any type. The total free momentum of the 
tree will be the sum of the node modes associated with the free nodes: note that it is quite different from 
the total tree momentum previously defined (see (d)) preceding (5.12). 

Looking at (6. 3), (6. 5) we see that the first and the last terms contain constant factors which can be taken 
out of the integration operations associated with the other tree nodes. Thus the bubbles involve simpler 
integrals. Furthermore the three terms in the (6. 3), (6. 5) individually preserve the time-angle parities, as all 
the operations in §4 did: this is essential as it will permit us to use the parity arguments derived at the end 
of §4 (see C) and D) in §4). 

2): (Resummations): 

Consider a bubble containing a subtree of order h linked to its root node v' > uo by a branch v'v carrying a 
momentum v = '}2w>v '^i'^) and a label j (0 < j < Z — 1). Fixed i7, j, h we can sum over the possible choices 
(consistent with the labels h) of the subtree and its decorations: 

.h,i.,.a _ / W)-' E !°oo^^, (-) dgr , J = 1, i-i 



gh,2 iv.a ^ \ -iJa')-' E hf iZ'^arFijir) dgr , j = 1, I - 1 

\-M-'Ehf-Z^9Woi{r)F^bAr)dgT , j = 



where wqi = (Jog) ^wqi = jw, see (2.15), is a convenient adimensional matrix element, and F!ij{T) is the 

function resulting from the resummation. The terms /^^^j^ correspond to wiggly bubbles, while the 0p'f^ 
correspond to smooth bubbles. The "value" of the subtrees on which we are summing is given according to 
the rules described in 1)-=- 4) above. The have the interpretation, which we leave to the reader to verify, 
of being proportional to the j-th component of the Fourier transform of mode P of the homoclinic splitting 
A'*(a) (hint: just note that —'f°^^ = 'f^^^ ~^-oo ^^'^ recall the expression of X'' + (0+) and X'^~{0~) in 
terms of trees and the remark 2) following (5.14) which implies that Fbj in (6.6) is really the same function 
F'^ considered in the previous sections, see (2.14)). 
Taking care also of the constants of proportionality it is: 

A^-,- Jo5/3^o ' ^h+j = J9Pi:j, {1<J<1-1) (6.7) 
(note that the bubbles values are dimensionless). 

The /3ff(^ are new objects, while the quantities a^j that one would obtain by eliminating the sign function 
a from the integral^ in the definition of (3^''^ would be the j-th component of the scattering phase shift (see 
[CG] for a definition of the notion of phase shift): however such interpretation will not play a role in our 
discussion; hence we do not pursue the enterprise of checking the latter statement. 

The parity properties, discussed at the end of §4, allow us to conclude immediately that: 

^ - At^j, (i'lf ^ - (3% a% ^ atv, (6.8) 
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Hence: A'i = 0, = 0. 

3): (Definition of resummation trees and of dry and ripe fruits): 

The above remark suggests introducing a new type of trees, that we shaU caU "resummation trees", or 
7^-trees. A resummation tree, with all its decorations (TZ and bubbles included), is defined by drawing a tree 
with all its decorations and by deleting the contents of the bubbles, leaving only the branches connecting the 
top free nodes to the nodes v inside the outermost bubbles (if any). We shall leave the bubble around each 
V and on the branch leading to v we write the label and the total momentum i/x^ of the deleted subtree 
together with the total order of the deleted subtree. 

By construction the resummation trees have bubbles which can contain only one node v. It is natural to call 
such resummation bubbles "fruits " ( "of the resummations" or "of the trees" , as preferred) . If the bubble is 
wiggly we call it a " dry fruit" while if it is smooth we call it a " ripe fruit" ; the node of the fruit will be called 
the seed of the fruit. As we shall see the dry fruits values will be estimated, in our inductive construction of 
bounds on Ai. c^asily in terms of the inductive hypothesis, while the ripe fruits will, at each inductive step, 
be just "ripe to be bounded". 

It is convenient to define the "value" of a resummation tree by the same prescription we used for the 
previously introduced trees, but changing the symbol corresponding to the seed v (preceded by v') from 
(5.12) to (see (6.3), (6.5), (6.6)): 

^](^')#e-- with .1W = {:J^(,), IJjlo-'^"' (6-9) 
if the fruit surrounding the node is dry (the dimensionless matrix element wqi is defined after (6.6)); or: 

-l(^')/3fev- with .|w = {i;„„(,), iEjio""'"' (6-10) 

if the fruit is ripe. 

The name seed is fairly appropriate as it really represnts a sum of appropiatcly evaluated tree values of 
trees having the seed has a root. "A seed can be magnified to reveal its content in trees" whose values add 
up to the value, (6. 9), (6. 10) of the fruit containing the seed (as it should be). 

Finally we renumber such trees (as described at the beginning of the section) according to their topological 
structure (i.e. resummation trees do not inherit the numbering from the trees that generated them). 

This is a good definition as we can now say that the sum of the values of the decorated resummation trees 
of given order is equal to the sum of the values of the previously considered trees of the same order. The 
non trivial part of such claim is dealing with the combinatorial factors; the correctness of the combinatorics 
is easily verified if the trees are regarded (as we are doing) as having all the branches distinct. 

A simple cancellation can already be seen: consider the contribution to Fi - {t) coming from trees with (J 
total free momentum and even fruits number, then given a tree we can consider another "conjugated" tree 
with the node momenta of the free nodes simultaneously changed in sign; summing their two contributions 
we see that the integrals to be performed arc integrals of a time odd analytic function over the whole line: 
such integral will vanish (by the parity considerations of §4). Hence we have the mechanism: 
I) Cancellation of the contribution to (3^'} from the 72.-trees with an even number of fruits, summing over the 

choices of a global sign multiplying the free node modes. Hence trees without fruits and with free momen- 
tum do not contribute to the value of ff'j'i , i.e. to the Fourier transform (in the a angles) of the splitting 

(note that v is the total momentum, hence in general it is u ^ Q). We call this a "parity cancellation". 
A deeper cancellation mechanism is: 
II) Cancellation of the contribution to the splitting from 7^-trees carrying at least one fruit with label 
j > 0, summing over all the possible ways of attaching the fruits to the tree. We call the latter the "KAM 
cancellations" as they extend the cancellations discussed below, leading to I), II) at the end of §6, and are 
sufficient to carry to KAM tori the Sicgel Brjiuno method (§7). The (easy) proof is exactly the same. And 
it is not a parity cancellation: it will be called a " KAM cancellation" . 

We see that there are still quite a few cases of trees with free momentum that can contribute to the 

homoclinic splitting. A few more cancellations can be spotted.® 

We shall see that we do not have to worry about trees containing at least one dry fruit; but the basic 
problem remains for the trees with only ripe fruits. The problem is solved, for the trees with a root label 

^ for instance: 
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j > Ihy the following really remarkable mechanism, which is a non trivial extension of the above mechanism 
II). 

IV) Cancellation of the 7?,-tree values contributions to the splitting from 0-free momentum trees with root 
carrying a label j > l,hy summing over all the possible ways of shifting the root branch location to other 

free nodes of a given rootless tree. 

This is another cancellation which is not a parity cancellation: it implies as a particular case the KAM 
cancellation above, but the latter is very elementary (see below), while the more general cancellation IV) is 
much deeper, and it is essential for the homoclinic splitting theory (for which the KAM cancellation is not 
sufficient). With some thought one realizes that without cancellation II) for the KAM theory, and IV) for 
the homoclinic splitting the above "nice" formalism would be essentially useless. The proof of IV) is based 
on the "tree root identity", due to L. Chierchia. 

4): (Tree root identity), (Chierchia): 

Consider a resummation tree without fruits and consider its i?-value. Suppose that the total mode of 
is z7(yo) = 0, and let j be the root label, h the order of i9. Fix v\,...,Vs and ji, . . . , j^, where s > and 
ji G (0, . . . , ? — 1). Consider all trees obtained by adding to ?9, in all possible ways, s branches with a ripe 
fruit carrying the labels /ij, Vi^ji with « = 1, . . . , s, or no fruit at all (s = 0). Suppose that the root label is 
3>l- 

We want to prove that such trees (with vanishing free mode) contribute to the splitting A,^^, by a 
cancellation mechanism based on the symmetry property of the following quadratic form, setting Fii{t) = 

QG{t) = Y^ I w{t,T)G{T)dT (6.11) 

p=± 

/ + 00 i-t 
g-fli.*ts^i J2 / w{t,T)F{t)e-''^-^-^G{T) dT = {QFr„Gr,) 
-oo p— ± 

where w{t, r) is either t — r or woQ{t)woi{T) — woo{T)woi{t), see (6.3), (6.5), and F, G are arbitrary elements 
of A^. The (6.11) is immediately checked, for any i?i,i?2 large enough, from the definitions and uses only 
the antisymmetry of w{t, r). 

Integrating (6.11) over R\,R2 on the appropriately small contours, to get the residues at i?i,i?2 = 0, see 
appendix A2: 



dt^j- w{t,T)F{t)G{T)dT = j- dt^f- w{t,T)G{t)F{T)dT (6.12) 

— oo „ J —pco J — oo „ J —poo 



This identity, and the remark that it is relevant for the cancellations, is a key property and is due to L. 
Chierchia (private communication). 

If we consider a tree "& with root label j > I and we take t = Ty^, t = t^^, vi being any of the nodes 
immediately following vq, the above identity has a simple graphical interpretation (formally discussed in 
appendix A2). Let = ■■■ ,'&n be all the trees that can be obtained from •& by detaching the root 

branch from the node vq and attaching it to another node Vj with S^. = 1 {i.e. to a node to which the above 
operation gives rise to another of our trees). Then the symmetry (6.12) implies that the TZ value of any of 
the trees thus obtained has the form V ■ i{i^vp)j with V independent of 'dp 

Hence we see the following basic cancellation mechanism: if the root free mode is then g v^j = 
and the sum of the contributions to the homoclinic splitting coming from the above family of trc;es vanishes. 

We conclude, in particular, that the TZ-tree value of the contribution to the action splitting of the resum- 
mation TZ-tree values with ripe fruits only and free mode is always 0, if the root label is j > I. Hence we 
do not consider them in the analysis of the splitting. 

5): (The energy conservation cancellation): 



III) Cancellation of the contributions to from 7?.-trees with free momentum, ripe fruits only, and with ripe fruits carrying 
only the label j = 0, and with all the free branches carrying a j > label, by summing over all the possible ways of attaching 
the fruits to the tree. This cancellation is also remarkable but its role is not very clear; its proof, not immediate, will not be 
given here. 
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We cannot, however, conclude that also the general contribution to the splitting A; of the resummation 

TZ-tree values from trees with free mode is always 0. 

This will not be so bad, after all, as the energy of the two whiskers is the same so that (see (1.1), (2.2)) 
Hn{X'^{t; a) = Efji^= X^;j>q Ehfjp- where, by the KAM results reported in §2, is an analytic function of /U 

near ji = Q and is independent of a (and of course of t and a). Recalling that -?/nj.(0; a) = a, Xju-(0; a) = tt 
and that /°(0) = (0) = 2 Jog we find for h>l: 

w-J?f<^(0) + 2gX^'^(0) + i (^"'^("'"(O) •Xj''"(0)+ 

hi+h2=h 
l<hi <h-l 

+ Jo-iX^i"(0)X^^"(0)) + Shi ^ Ucos{d-u + mr) = Eh 

where X'^'^{0) is short for X'^'^{0;d) and 6ih is the Kronecker symbol (and for /i = 1 the sum over hi is 
absent); recall that a label I has been equivalently denoted + and that the labels j > I are collectively 
denoted j. Taking the difference when a = + and a = — one obtains: 

h-l 

Alid) ={-2g)-'(u-md) + - ^ J-'A'^-'^id) ■ [X\+{0-d) + X\-{0-d)] + 

fc=i 

+ j^'AX-\d)[xl+{o-d) + xX-{o-d)]) 

Hence in some sense the A^ components of the splitting may be "less important" to control. One may also 
think that the energy conservation allows us to "transfer" the cancellations that we have shown to exist in 
the expressions for the rotator components of the homoclinic splitting to the pendulum components. 

B): KAM cancellations. 

The above analysis can be applied also to the case in which the pendulum in (1.1) is decoupled from the 
rotators, to provide a graphical representation of the invariant torus which continues the unperturbed torus 
A = Q. The simplest representation of the torus is: 

a = aeT'-\ A = X^{Q]d) (6.15) 

which is obtained by simply considering the trees d with only "rotators labels" , i. e. with node and branch, 
angle or action, labels j taking values j = 1, . . . , Z — 1 and j = / + 1, . . . , 2Z — 1 (hence excluding the 
uninteresting 0, 1). The root time label is t = 0+ (or t = 0~) as we are computing at time 0). 
The root time t is again taken to be t = 0, and: 

^ /-o 

X^i;{0;d)=f- Fl^^(T)dT, X^ff{Q-d) = Q (6.16) 
J +00 

where Fi is the same function in (6.6), resulting from the resummations (hence, by the remark following 
(6.6) it is the i7-Fourier component, in the a-angles, contribution to (4.14)). This time we only integrate up 
to because we are not looking at the splitting (which, see below, vanishes) but at the value of X^ at f = 0. 

However the (6.16) does not seem to be the most convenient representation, not only from a technical 
viewpoint but also because it does not lead to a simple representation of the motion on the torus. 

A better representation is obtained by remarking that F^{t) corresponds to a solution a solution f — > X(i), 
on the torus and starting at a, and has the form F^{t;d) G M, see (3.4). We shall denote {'il) ,t; d) the 
ftmctions in the representation (3.1) of i^^(t; a) = X^j>Q |j-$i(u5t, i; a). The notation is slightly different from 
that of §2, §3, §4 where argument t in ,t]d) appears in the form x = e"*^^*; (this is irrelevant and we 
shall see that $i(V' , t; a) is in fact t-independent). 

The absence of coupling between rotators and pendulum produces the following great simplifications, with 

respect to the case of the homiclinic splitting. 
0) The +00 in (6.16) can be replaced with —00; i.e. the splitting vanishes. 
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(6.13) 



(6.14) 



1) The functions $i have rotator components vanishing if i > 0, so that the rotator components of F{t; a) can 
be simply written ^olLut^t; a) {i.e. no secular terms ||- arc really present); 

2) The function $o(V' > t\ ^) is t -independent and {ip , t; a) has average over %p . 

The above properties are an immediate inductive consequence of the (6.15), and of (4. 9), (4. 10), see below, 
(or, alternatively, of the KAM theorem). 

One sees, in fact, that the splitting is zero, to all orders of perturbation theory, simply by remarking that 
from (4. 14), (4. 13) we see that there are no traces, in the present case, of the pendulum separatrix motion. 
Therefore is a analytic quasi periodic function with average. Hence is also quasi periodic and 

analytic because the operators X and J~^T do not generate any non quasi periodic nor non analytic, nor 
secular terms when applied to average quasi periodic functions of t. Therefore F^, see (4.14), is still 

analytic, and also quasi periodic in t with average over t, by the cancellation (4.15), etc. Therefore we 
never generate non analytic non quasi periodic terms. In particular X^{0) can be eqiiivalently given by the 
(6.16) with — oo replacing oo: hence the splitting at vanishes (being X'^{0'^) — X'^{0^). This simply means 
that if there is no interaction between the pendulum and the rotators, the whiskers remain degenerate to all 
orders of perturbation theory. 
The above implies that we can write: 

A{t) =I^^{ujt;d), d{t) = a + ujt + J~'^T'^^'f{ojt;d) (6.17) 

where $(V' ; d) is the function F(^ ; a) regarded at fixed a and as a function of tp . 

Remarks: 

1) : We have not invoked here the KAM theorem in describing the above algorithm to build recursively 
the ^'^{ujt; d), i.e. to construct a formal power series describing via (6.17) the motion on the invariant torus 
(which, unless we use the KAM theorem we still have to prove to exist by proving the convergence of the 
series). The (6.17), which coincides with (5.15), is here a well defined formal power series with the coefficients 
defined by the recursive algorithm discussed in connection with (5.15) and recalled above. 

2) : in the present case, in the notations of §4, see comments to (4.1), it is F'^'^{\j; , aoo) = $''(^ ; d) so that 
(4.9) implies <^'i = 0. 

3) : Since the w is a diophantine vector the above points {A{t),d{t)) cover densely the torus as t varies; 
hence calling ut = and setting a = and $(V' ) = ^(V* i 0)) we realize that a natural parametrization of 
the torus is: 

oo oc 

fe=i fc=i 

- r - r 6-18 

^ ' luj -v ^ ' (iw • vY 

and in such representation ip has the interpretation of "average anomaly", i.e. the evolution is simply 
ip ^ ip + ujt. By (4.9) $g = 0, and the u = terms are therefore absent (care has to be exercised here in 

not confusing the Fourier transorm with respect to tp and that with respect to a: we are not talking about 
the latter as a is set = 0). Since we set a = we could use the parity properties of which, if a were not 
0, would be jointly odd in d,ijj , reflecting that F^ is time angle odd and implying that if a = then it is 

odd in tp and hence <&g = 0. But <I>g is equal to also if a 7^ (), by the cancellation remarked in (4.2) (see 
(4.9); see also remark 2) above), which is not a parity cancellation. 

4) : To calculate F^ at a = 0, hence to calculate $ at a = (), we can use the tree expansion. The <i>^- can 
be identified to be given by the trees of order h with time label t„(, of the first node fixed equal to the root 
time label t, and with jy^ = j and total free momentum H. We use the remark 2) after (5.14) to establish 
the connection between F^ and the tree expansion. 

In fact, if we remark that: 



dr' = -^-^ (6.19) 



we see that a resummation tree with free momentum j7 has a -dependence proportional to e^'^ '^'^^o . Since 
the trees are evaluated stopping short of computing the "last integral" over the Ty^ which is, instead, set equal 
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to t we see that the tree value for the evaluation of e""'^ is then obtained by replacing such exponentials 
in t by e^'^''^ : summing over the trees yields e^'^"^' <§'l. In the above calculations a is set = 0. 

5) : since the splitting is identically zero to all orders there will be no resummation trees with dry fruits. 

6) : thus we se that we can use the formalism developed in the case of the theory of the splitting, to study 
X'' for a arbitrary. 

But to take advantage of the easy evaluation of the integrals and of the fact that we only want a = in the 

present case, it is useful to quickly go through an essentially independent and more detailed analysis. We 
can start by remarking that the integrals associated with the tree vertices are straightforward because: 



/ (t - r)^ e--^- dr- {t - r)^ e^-^^ dr = Z - C 7-^^71X7, C = 0, 1 

J (a) J (a) {lUJ-vY+i {lUJ-vY+i 



(6.20) 



which takes the place of (6.3) and (6.5). 

We can mimick, therefore, the procedure followed in the resummation scheme developed for the homoclinic 
splitting cancellations. 

Given a tree d contributing to order h we represent the £^ operations associated with a node v by affixing 
a "bubble" around the node v and enclosing all the branches following it, or by writing a label TZ on the 
node V. The bubble will mean that after evaluating the J~^I or I operation via (6.20) we chose the second 
term in the r.h.s. of (6.20) (with = 1). If the node is not enclosed in a bubble that cuts the branch linking 
it to the previous node, then this means that it is marked by TZ and that in the sum in (6.20) we selected 
instead the first term (like in the previous case). 

We can repeat the same representation operation on the (improper) integrations pertaining to all the other 

nodes, starting from the highest on the tree and going down towards the root: we stop at the first nodes 
following uo, (the latter node is somewhat different from the others as the corresponding I operation will 
not be performed, as we evaluate and not X^). 

Thus the most general contribution by trees of order h to will be represented by a tree with h nodes with 
index 5 = 1 (and h branches, because now there are no nodes of order 5 = 0) and all the labels introduced so 
far iJZ labels included) plus an arbitrary number of bubbles drawn around tree nodes above vq, and drawn 
as described (avoiding overlappings) . 

We call free the branches that are not enclosed inside bubbles of any type. The total free momentum of the 
tree will be the sum of the node modes associated with the free nodes. By construction the t dependence of 
the tree value appears only through e'""*, if v is the total free momentum of the tree. This is special for 
the present case, and it does not hold for the homoclinic case. 

Hence the v-th Fourier coefficients, in , of ^'^{tp ) are simply equal to the sum of the tree values of the 

trees with order k and total free momentum V, evaluated att = 0. They will be denoted, as usual, This 

also provides an interesting interpretation of the free momenta. Note that in this way the concept of total 
momentum (as opposed to that of total free momentum) does not even arise. 

Looking at (6.20) we see that the second term is a constant factor which can be taken out of the integration 

_._2 

operations J T associated with the other tree nodes. Thus the trees with bubbles involve somewhat 
simpler integrals (although not dramatically simpler as in the case of the homoclinic splitting, because in 
this case all the integrals are essentially trivial). 
Consider a bubble containing a subtree of order h linked to its root node v' > by a branch v'v carrying 
a total free momentum i/ = Vf{v) = X]i^>„^V), where the * means that the sum is over the nodes which 
are inside the bubble but not inside inner bubbles, i.e. which are "relatively free"), and carrying a label j 
(1 < j < Z — 1) , and a label h equal to the order. Fixed j7, j, h we can sum over the possible choices (consistent 
with the labels z7, j, h) of the subtree and its decorations obtaining the "resummed bubble values" : 

Pvj = -TrJ-^^%, i7?^0 (6.21) 

The cancellation remarked after (4.14), see also (6.18), implies that (fi. = and (6.21) is interpreted as 
for 17=0. In fact it could be easily checked by parity considerations the more general relation: 

I3^j=-p1^^, j = l,...,l-l (6.22) 
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Therefore the above resummation suggests introducing again "resummation trees". A resummation tree, 
with all its decorations (bubbles included), is defined by drawing a tree with all its decorations and by 
deleting the contents of the bubbles, leaving only the branches connecting the top free nodes to the nodes v 
inside the outermost bubbles if any: we call Vf the set of such "non free" nodes. We shall leave the bubble 
around each v, and on the branch leading to v we write the label jx^ , together with the total free momentum 
and the order of the deleted subtree. 

By construction the resummation trees have bubbles which can contain only one node. It is natural to call 

such resummation bubbles "fruits", as in the previous case. And the nodes v G Vj- wc again shall rightly call 
seeds, see remark following (6.10). Note that also from the new viewpoint there is only one type of fruits. 
Setting /i7^o = fa and calling j7/(v) = J2v<w<Vf the free momentum of the node v, the "value" of a 
resummation tree contribution to $1^- will be simply: 

vo<viVf v£Vf 

by (5.12), as in this case riy = Q, Sy = 1. The value of F^p is obtained by summing over all trees i? with order 
h and total free momentum v: naturally the order h is the number of free nodes plus the orders of the fruits, 
but the total momentum is the sum of the node modes excluding the modes of the fruits. 

Finally wc renumber such trees (as described at the beginning of the section) according to their topological 
structure (i.e. resummation trees do not inherit the numbering from the trees that generated them). 

This is a good definition as we can now say that the sum of the values of the decorated resummation trees 
of given order is equal to the sum of the values of the previously considered trees of the same order. As in 
the homoclinic splitting case the correctness of the combinatorics is not difficult to verify if the trees are 
regarded (as we are doing) as having all the branches distinct. 

Remarks: 

i): The collection of all the trees of a given order and total free momentum must give a zero contribution to 
because {ip ; 0) has zero average by (4.9) , or (6.22) . Hence Vf{v) can, and will, be supposed without 
fear of errors and without ever having to divide by zero in the trees evaluations (recall the diophantine 
property (1.3), showing that zeros can only occur if Vf{v) = 0): the terms in (6.23) with some of the 
denominators equal to have to be regarded as 0. 

ii) Another essential cancellation is the following. Given a tree suppose it to be such that Vf{v) = Vf{v'), 
(^ 0), with v' ,v being a pair of consecutive nodes in ^, for some v > vq- Then the v' must be a node 
with a bifurcation with at least two branches (as, of course, i/y ^ for all v). We can then consider the 
subtree ??2 C t9, with the same root as -d, obtained by deleting the branch v'v and the following ones. Since 
yf{v') = yf{v) by assumption, such tree will be such that Yliw>v' '^^ ~ ^■ 

with root v': imagine to attach it to the remaining tree by pinning it to the nodes w > v' . Prom (6.23) it 

is clear that in so doing we get terms equal to some w- independent constant times ii/wj' (if.?' is the branch 
label of v'v) provided we could neglect that, in so doing, some of the denominators of a few of the branches 
above v' change value. The change in value is simply the addition or subtraction of e = uji'{v). Hence this 
can be hoped to be a good approximation if s is very small. In fact summing also over a simoultanous change 
in sign of all the modes i'w to which the pinning can be done it is clear that the sum of the values of the 
considered family of trees vanishes to second order in e. 
Hi): The same argument holds if v and v' are comparable in the partial order established by the tree, i.e. if 
v' < V but v' is not necessarily the immediate predecessor of v. 

Therefore we can approximately compute X!^{0;d) by summing only trees such that: 
I) Vfiv) ^0 if V is any node (seeds included). 

II) i'f{v) ^ i^f{v') for all pairs of comparable nodes v',v, (not necessarily next to each other in the tree order, 
however), with v' > vq. 

The consequences of the above proved cancellations are analyzed in the next sections (and in the relative 
appendices). 



29 



§7 Twistless tori: Siegel— Brjiuno— Eliasson method for KAM tori. 



This section has heuristic nature; we shall suppose that X\''\6;a) can be computed by summing over 
the trees verifying I), II) at the end of §6. The corrextions, i.e. the contribution from the other trees are 
accounted for in the Appendix A3, because the natural continuation of the heuristic argument is §8, while 
the corrections to the approximation II) require some new ideas (constituting Eliasson's main contribution). 
It will be clear that the approximation II) is very good, particularly if one applies it to momenta iy{v) which 
arc very large. 

To simplify further the proof we shall make a further assumption on Wq besides (1.3); namely we shall 
suppose that: 

min |Co|wo • - 7^1 > 7"+^ if n < 0, < |i7| < (7"+^)"^"' (7.1) 

0>p>n 

and it is easy to see that the strongly diophantine vectors, as we shall call the lSq verifying (1.3) and (7.1), 
have full measure in i?' if 7 > 1 and c are fixed and if r is fixed t > l—l: we take 7 = 2, c = 3 for simplicity; 
note that (7.1) is empty if n > —3 or p < n + 3. 

We proceed to prove theorem 1, §1: i.e. the persistence of the torus which for fi = is A — 0. Wc recall 
the hypothesis H: hence, in this section, the parameter 77 = 1 and therefore the system time scales arc set 
by IcjIjCq"^ and by the perturbation; and J is an arbitrary diagonal matrix with matrix elements Jj > Jq 
where Jq > is a positive constant. 

The basic idea of the following proof goes back to Siegel, [S] : his somewhat difficult proof has been greatly 
clarified by Brjuno and a very clear exposition and generalization of Brjiuno's work by Poschel is in [P]. But 
the connection with the KAM theorem and the tree cancellations, exposed in §6, are due to Eliasson, [E]. 

Let Af be the set of the harmonics v for which fp = fo^p in (1.1) docs not vanish: the number of such 
"perturbation harmonics" does not exceed {2N + 1)'~^ and the length |i7| = Yl Wj\ is < But the long 
sought connection with the KAM theorem is due entirely to Eliasson, [E] , who also realized the role of the 
tree expansions in establishing the connection just mentioned. 

Consider the functions defined in §6, B), see (6.18). We shall prove that: 

and D,B will be determined below, and depend only on .Jq, N,1,Cq,t,\uj\. Hence they do not depend on 
a lower bound on det J~^. Furthermore i| p = () if li^l > Nh, because the initial perturbation has only 
harmonics with < N. 

The discussion of §6, B), shows that (7.2) implies theorem 1, if valid under the just stated conditions, and 
the function x can be taken Xjy = -^^v- 

Consider a tree O with labels and fruits and regard it as representing a contribution to the l.h.s. in (7.2). 

We imagine to express each fruit value as a s\im over the trees that can be grown on its seeds, and so on. 
We can think that "each seed is magnified to show its content in subtrees", see remark following (6.10). In 
other words we undo, for the purposes of the estimates, the collection of contributions to which gave 
rise to the resummations and to the notion of fruits. 

But we take into account the assumed cancellations by constraining, from now on, the labeled trees appearing 
inside each of the bubbles (representing a magnified seed) to verify the properties I) (which is exact), and II) 

(which is approximaied) at the end of ^6. 

Therefore the contribution from the considered 9, and as a consequence the full value of ^'ij, is split as a 
sum of contributions coming from trees carrying an arbitrary number of bubbles, hierarchically ordered to 
avoid overlappings and each of which encloses a node of "& as well as all the subsequent ones. The number 
of nodes is of course h. 

Having undone all our painful collection of the trees contributing to ^'ij it is convenient to regard again all 
the branches of the trees as distinct and therefore the correct combinatorial factor with which to multiply 
the tree contribution to the is again h\~^, see (5.11). 

Fixed h and an unlabeled tree the number of ways one can put bubbles around the nodes is bounded by 

2^ (as there can be at most one bubble per node by our decoration rules); the number of node modes is 
bounded by (2A^ + l)('~i)''; the number of branch labels is {I — 1)'* and the number of trees is, by Cayley's 
count, see §5, footnote ^. Therefore a bound on bp- is simply: 

2^(2iV + l)('-i)''(? - 1)''22''M?, (7.3) 
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where M/, is just the maximum possible value that a single fully decorated tree can contribute to b^-. 

Suppose that in one of the possible trees i'} there arc n bubbles, counting among them also an extra bubble 
(drawn for convenience) and enclosing the whole tree except the root, and suppose that the i-th of them 
encloses h* branches. Then h = n + J2^=i K- because, (having added one bubble encircling the whole tree 
but the root), for every bubble there is one branch not entirely contained inside a bubble. 

Suppose that we can prove that for a fruitless tree the contribution to b^j is bounded by DqBq'^, for all 
fc > 1. Then it clear from (6.23) that one can take: 

Mh = laa^Uti DoB'o' (7.4) 

where the maximum is over n, h* such that h = n + X]"=i h^. Supposing that > Dq it follows immediately 
that (7.2) holds and that one can take (see (7.3)): 

Z? = (23(27V+l)'-i(l-l))-Do, B = Bo (7.5) 

Therefore the problem is reduced to the analysis of a fruitless tree. Hence we only must study: 

5,j,i7 = max* TT , (7.6) 
■& \Coijj-Vf{v)Y 

V>Vo J \ / \ 

where the maximum is considered over all fruitless trees with total free mode /7 and the * reminds us that 
only trees with the considered constraints are admmissible to compete for the maximum. The constant Cq 
is introduced to make 5h^p adimensional. 

Proceeding as in [P] , we fix, given j7, h, one tree ?9 on which the maximum is attained and we consider its 
mode labels and the subtrees with first node at v and root at v' , for any v > Vq. It is necessarily true 
that the subtree 'dy with the given mode labels is also a maximizer for 5h^,p{v)- 

A first remark, that should be at this point really obvious, is that we can suppose that < |i7 (u)| < h^N 
if hy is the order of the subtree with root at v' and first node at v. And therefore we can always suppose: 

\Coa-Vf{v)\>j^^ (7.7) 

by the diophantine assumption on uj. 

Given, arbitrarily, q>l integer we say that a harmonic v is q-resonant if: 

Co\o^--\<l^^C{<l) (7.8) 

where (^(q) is defined here. 

Then we define N^i/, h; q) to be the number of q'-resonant harmonics among the harmonics found in the 
tree where the maximum in (7.6) is reached. 
The following (extension) of Brjuno's lemma (see [P]) holds: 

N{v,h;q)<2^ (7.9) 

for the trees verifying I) ,11) at the end of §6. We do not repeat the proof as the present analysis has only 
heuristic value. 

Assuming (7.7) we can easily conclude the discussion; in fact choosing q = 2^: 

with Bi = cMEp=o 4? log(3Af"2-P). 

Therefore we can bound (7.6) by B'l- Recalling also that in the notations of §1 JiJp is also called E and 
it is supposed to be an upper bound on see comment after (1.1) and (1.4), and if = ^-^p^^, this 
means that Dq,Bo in (7.4) holds with: 

Z)o = {Clu^){BiB5), Bo = B^Br, (7.11) 
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And therefore (7.2) holds with: 



Bo = bi-^ Q-'Cl iv8-+i-i28-, Do = 6;Bo (7.12) 

where 6^, h\ are suitable constants depending only on I and h\ can be supposed without loss of generality > 1. 
Hence wc sec that B in (7.2) docs not depend on Jmax on Jmin- 

We see that if Jmin = +oo then 5=0: and the system is in fact completely integrable as already remarked 
in §4, around (4.16), (see also [Gl], probles 1,16,17 of §5.11). 

Clearly the bound (7.2) implies the convergence of our approximation to the scries for the invariant tori for 
< iJ~^: this gives an idea of the connection between Siegel's theorem and the proof of the KAM theorem. 
This was the situation before Eliasson's work. To complete the proof we must turn to the corrections present 
because the cancellation II) at the end of §6 is only approximate. A complete analysis is in Appendix A3 
and A4. 



§8 Theory of the homoclinic splitting. 

As a consequence of the above analysis wc get that, in general, the angles of homoclinic splitting (or 5, see 
(1.8)) are smaller than any power in t]. Let us denote A|i the coefficient of order h in the Taylor expansion in 
powers of and of order v in the Fourier expansion in a. of the splitting (^u, a) A(q?) = X+(0; (x)—X~ (0; a); 
then the property of smallness is an immediate consequence of the following bounds. 

Let d e (0, 1), and let: 

eh = eh{d)= sup e-l'^-'^°ls"'(t-'^), /? = 4(7Vo + 1), p = 4t (8.1) 

0<|!7o|<JV/i 

where A^o is the maximal (^-harmonic of the perturbation / in (1.1). Then, for j = 0, . . . , Z — 1 and for all 

J G [Jo, +oo) and h>\: 

|A^,,.|< Joffi)(677«)-\ \^%^^\<J^gDd-\Bd-<'f-\h-\)^Pe^ (8.2) 

where h < 1,D,Q,B are suitable dimensionlcss constants depending on the various parameters describing 
(1.1), but not on the perturbation parameters r],n; the proof in the appendix Al shows how to construct a 
bound on B; the latter is the only real problem as the existence of the b,Q,D is well known, (see §1 and 
[CG], for instance, §5, cq. (5.76), (5.78)). The constant Q has been briefly discussed in §1 and we take it 
to be the minimal such that the properties of peristence of the tori hold {e.g. Q = 10 if I > 2 and Q = 1 if 
I = 2, see §1). 

If I = 2 both bounds in (8.2) are uniform in J > Jq and one can take J +oo; furthermore r can be taken 
T = 0. If I > 2 the second bound in (8.2) is still uniform in J > Jq, but the first may fail to be uniform in 
the values Jj > Jq. Perturbation theory in fx is well defined to all orders, but it might be non uniformly (as 
the Jj 00, i.e. as the twist rate goes to 0) convergent even at small ji; see, however, the conjecture at the 
end of appendix Al. 

The bound (8.2) is of great interest in the case Z = 2 as it is sufficient to determine the exact asymptotic 
behaviour as 77 ^ of the splitting, (hence in the periodically forced pendulum, by taking J = +00, a 
permissible choice). Note that, if I = 2, it is eh = £1. 

We proceed to explain the strategy of our bounds. The first inequality in (8.2) is just (2.6). Note also that 
since we always suppose that / is a trigonometric polynomial of degree A'', it is actually A^^ = if > Nh. 

To explain the appearance of the "small" factor Eh in (8.2), let us consider the most general resiunmation 
tree, 1?, of order h with all its labels (see item 3) in part A) of §6), representing one contribution to the 
homoclinic splitting A^^.. 

Let r be the root of d; let vq be the first node after the root; let Vf be the set of top free nodes, i.e. the set 
of top nodes still outside the fruits (if any) of and recall that cannot be surrounded by a fruit. Let Vf 
be the set of top nodes contained inside a fruit {Vf might be empty): i.e. Vf is the set of fruit seeds. 
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Recall that in general /ia„ denote the order of the subtree following v: the fruits contribute to the order a 
quantity given by the order label /ia,, leading to them. We shall call Vi, . . . ,Vm^ the nodes following v, 
Vo < V <Vf, and v' will be the node preceding v; my = ii v G Vf. 

With the above notations it is easy to "write" the sum over a of the values of the contributions to /3^'7 
from the resummation trees ^? of order h and total mode P and root label j = jyg > I: 



vo<v<Vj ' 



vo<v<Vf ^gV/ 

where (see (5.12), (3.4), (5.13), (5.14), (6.6), (6.9), (6.10)), and the dimensionless coefficients c^^ are given 
by: = {Jg'^)~^U^Sy6jjto,i + {Jog'^)~^ fi^Sy6j=o,i - (1 - Sy) where 6j=o,i is 1 if j = 0,Z and otherwise 
and a similar, complementary, meaning is given to 5jjio,l (recall also that if = then i/y = 0) and: 

yj (t t)= i ^oo(i)wo/(T) - lBoi{t)woo{T), V>Vo ,jy=l 
' ^ ~ \ g{t-T), v>vo ,jy >l 

( awQi (r), if j = / and 7=1 

agr, if j > Z and 7 = 1 ^7 ^ f -^l 1^7 = 2 

^'^oo('''), ifj = Zand7 = 2 ^ \ if 7 = 1 

.1, if j > ^ and 7 = 2 



xVr) 



where the dimensionless matrix element iuoi was defined after (6.6); 7 = 7„ is 1 or 2, respectively, if the fruit 
encircling v is dry or ripe; m is the total number of branches (root included); the integers decompose 
m,, and coimt the number of branches emerging from v and carrying the labels s = 0, . . . ,1 — 1: Xy, vx^,,jx^ 
in the product over Vf are, respectively, the branch entering the fruit around v, the momentum carried by 
and its j-label. 

Remark: the Ty integrals in (8.3) are, in general, not convergent for Ry small: therefore the improper integral 
symbols -f are used and they have to be thought as the analytic continuation in Ry from ReRy > and 
large. 

If wo do not perform the Ry integrals in the above (8.3), we sec that the result of the improper integrals 
is a holomorphic function of Ry which can be continued to the region of positive and large Rei?t,. At such 
large Ry the integrands are well decaying as Rct„ ±00. 

Suppose, furthermore, that the tree contains only ripe fruits and represents a contribution to the splitting, 
{i.e. jyg = 1), (the case of no fruits being in our notations a special case of only ripe fruits). Then the 
integrals are holomorphic functions of the t„ in the strip g\ ImTt,| < ^ — d, except for the factors e~'''"^^^'^'" 
and provided the (7«)^gy^. , are all equal to 2, and provided 7^,;, = 1. 

The residues have to be evaluated starting with those relative to the top free nodes (and in arbitrary order) 
and continuing hierarchically, {i.e. always evaluating the residues relative to the highest remaining nodes, in 

arbitrary order). 

Note that the evaluation of contributions from trees with dry fruits as well as from trees with 7„(, = 2 
{i.e. to values of ripe fruits) involve, instead, integrands with singularities at Ref = see (6.9). It is a fact, 
unfortunately, that the interplay between 7 = 1 and 7 = 2 might look, at first, confusing. 

If the tree has 7^^ = 1 and 7^ = 2 for all v €Vf, then its value (8.3), i.e. its contribution to is such 
that we shift the integrations over the gTy to the branches with fixed imaginary part ^ = ±(^ — d) choosing 
the sign as we like, by using the shift of contour formulae (3.15). 

This implies that, if V^f is the total free momentum of our tree, we can compute (8.3) as: 
^^^•^^^ J 2^/ dgTy,e-'^^oR^oi9r.,+'i)^^.^ry, + ^^-i^). 
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vo<v<Vf 



if all the jv, v G Vj, are equal to 2 and if = 1 (otherwise the x'^^ or the; ^I'J"" arc not holomorphic in the 
strip and the argument docs not apply). The factor e~^^ ^k-i^ol jg extracted, after shifting the contour up 
or down according to the sign oi lo ■ i^o, from the factors ^^'^-'^i'^v+tg 0_ 

The factor e~^^ I^j-voI qq^^ bounded by Sh, provided i/q 0: the latter case i^o = can indeed be; but we 

have proved that the cancellations I) show that it gives in fact exactly contribution to the total splitting, 
after summing its value for all the fruitless trees. Thus the presence of the factor Sh is explained, to all 
orders, in the contributions coming from the (sums) of tree values of trees with only ripe fruits and Po ^ 
or from trees with no fruit at all and any (equal to or not). 

The case of trees with only ripe fruits and free momentum is also giving a zero contribution to the splitting 
if the root label is j > I, by the cancellations II) and IV) of §6. It remains the case in which the root label 
is j = I: this case, however, is covered by the energy cancellation of §6, once we understand why the small 
factor Sh appears in the bound of the splitting to order h in the cases j > I. 

But the bound (8.2) holds by direct calculation for h = 1; therefore one assumes it inductively for h < k. It 
is clear that to order k the trees with at least one dry fruit will have a value containing as a factor the value 
of the splitting to some lower order, in turn bounded by a quantity containing a small factor e;, < Sk, by 
the inductive assumption. Hence we just have to worry about the trees with only ripe fruits (no fruit at all 
being a special case): but they are the ones for which the above analyticity argument is correct. Henc;e the 
factor Sk is present in the bounds of the contributions to the splitting from the trees of order k and either 
with total free momentum or with no fruits or with at least one dry fruit. 

The contribution to the splitting from the trees with ripe fruits only and total free momentum also gives, 
by the above reasons, contribution to the splitting for the t components which are therefore, to order 
k bounded proportionally to Sk- hence also the +,i.e. the j = 1-component of the splitting is bounded 
proportionally to Sk- 
it remains to estimate the integrals: but this can be done rather easily from the analysis of integrals like 
(8.3) or (8.5) performed with the aim of finding bounds which do not exhibit the small factors Sh (which we 
already know have to be there). In fact such integrals can be essentially exactly computed. 
A full discussion of the straightforward but lenghty analysis is reported, for completeness, in appendix A. 
We proceed to show that the bounds (8.2) {i.e. ultimately the proved canellations) and the proved cancel- 
lations = imply that the splitting is smaller than any power. 
By (8.2), the splitting can be bounded, for any multiindex a, by: 



1-1 



oo 



\diA^{0)\ < DJogY, E \l^\''W\^^^^i^{{bv'^r\Bheh{d)} (8.6) 

h=l 0<|i7|<Af/i 

having denoted = B^~^d~^^{h — 1)!^. Note that, if N is the trigonometric degree of the polynomial 
/ in (1.1), the sums over u can be suppressed by multiplying the /i-th term by the mode counting factor 
(27V + l)'^('-i)+l"l (i.e. the maximum number of non zero Fourier components times the maximum of \v\°'). 
From this bound it follows that |9^A|| is smaller than any power in -q (see (1-2)), a result that, as mentioned 
above after (1.8), also follows from the theory of normal forms ([Nei]). In fact we can split the sum over h 
in (8.6) into a finite sum, X^i</j<;j(,(-) and a "remainder", '}2h^h^{-); then, if < ^bvp and jj is small we 
find: 

E(-)^^^o.E(S)''^2I.^o.(^)^° (8.7) 

h>hQ h>ho 

where C = {2N + l)('-i)+l'*l takes into account the summation over the {2N + 1)'~^ modes for which 
does not vanish; and: 

ho 

E(-) ^ DJogho\^^\C''"d-^''"B''"-\ho - ly.^Shoid) (8.8) 

h=l 
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Thus if /It = vp^^, d = sjr\, and s > 1 we see that fixing /iq = r/s, for any r > 1, the |95A^| is bounded by a 
(r-dcpcndcnt) constant times rf (as in such a case (8.8) is just a remainder, exponentially small in r\~^l'^^. 

Note that when I = 2, the splitting 6 introduced in (1.8) is just 9aA| and \a\ = 1 in (8.6). 

In the case I = 2 (too > 0) we can get more precise asymptotic estimates: namely we can prove the well 
known exponential decay of the splitting, as e~'^'^°l'^^^ , for jj — > when |/u| < 0{rfi) with Q large enough, 
see the theorem 2 in §1, and below. 

To check the latter claim on the asymptotic value of the splitting we simply remark that (8.2) imply the 

convergence of the scries for the splitting for < B~^rj^^'^°~^^^ (recall that d = ^Jr\. Thus the second and 
higher order terms can be simply bounded by: 

-^^o.i)i:(-|^)', if |Ml<^r,-W) (8.9) 
because, if ? = 2, it is eh = £i- 

Hence we compute explicitly the first order term. We find (computing from (4.5)), with the above notations, 
that the first order term 5i = 9qA|(0) is given by: 



l"l<N 
l">l<No 



/oo 
[sin wnf sin m(^° — cosa;nt(cosm<^° — l)]dt (8.10) 
-oo 



For all J < +00. Note that J = +oo is, if Z = 2, possible. 

A simple calculation shows that the leading order (easily studied in terms of the elementarily expressible 
auxiliary functions: ^lunt^^mvp (t) _ jg given, as a; = Wo/\/?? — * oo, by the terms in (8.10) with 
m = ±1, n = ±No and it is: 

,--o+iA,e--M , A. ^ -,-(^)-°-(/,„,, (8.11) 

g (zJiVo — ij! 

provided ^0. 

Thus we see that the first order dominates if: 



which means < 2j^gD ^'^^°~^^ implied by < rj^ with Q — 3iVo + | and r/ small enough. 

Comments: 

1) A theory for the latter case, in a somewhat different set up^, can also be found in [DS], [Ge], [GLT], 
[HMS], [La], based on arguments not relying directly on the cancellations between the contributions to the 

coefficients of the perturbation series. 

2) Note that a value ^ in (8.1) would not spoil the asymptotic formula (1.9), if / = 2: it is easy to check 
that it would simply make it valid for Q > A^o + | + P- 

3) The above estimates can be obviously adapted to prove that the quantities ■, obtained by looking at 

the definition (6.6) of (3^'^ but replacing the cr by 1 inside the integrals, also verify the bounds (8.2), with 
different constants. Note, in fact, that the replacement of cr by 1 makes the integrands, in the contribution 
to a^ j coming from trees with only ripe fruits, analytic thereby making it possible to imitate the above shift 
of contour argument. 

In [CG] the notion of homoclinic scattering, and of scattering phase shift function (j{d), was introduced and 
it could be checked that the j have the interpretation of Fourier transform of the scattering phase shift 
function a {a): we do not give a formal proof of the latter identification. 



In [DS], [Ge], [HMS] a forced pendulum ip + r] sin (f = jii^sint is considered. After suitable rescalings, such equation arises 

as Hamilton equation for the hamiltonian + ^ cosi^ — iJ-ipsina, to which our techniques seems to be adaptable. The 

adaptation is needed because the latter hamiltonian is not periodic in tp, a property used explicitly in our analysis. In [HMS], 
[Ge] the exponential smallness of the splitting is obtained under the assumption that ]/i] < const. with q given, respectively, 
by 4 and 5/2; in [DS] a is any positive number. In [La], [GLT] the splitting of separatices for the so-called standard map is 
considered. Our technique seems to be adaptable to cover also such a case and would confirm the exponentially small value of 
the separatrix splitting found in [La] and [GLT] (see formula (6) in [GLT]). 
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Appendix Al: Estimates. A conjecture. 



Our purpose is to study the integral (8.3) and (8.5). The estimates are straightforward as we are wihing to 
tolerate for I > 2, bounds proportional to a power, (any), of the factorial of the order k and to an inverse 
power, (any), of d'^ . The only really non trivial point being the understanding of the cancellations in §6 
which, by the shift of contour argument of §8, provide the small factor e^. We provide the estimates only 
for completeness, as the results that we are deriving have often been presented in an incomplete form. 

The reader will realize that the following "estimates" arc in fact an essentially exact calculation of the 
integrals involved. Such a feat is made possible by the simplicity of the model 2) in (1.1). 

Let 1? be a resummation tree with m = ruf + mo nodes, m/ of which are endnodes v contained in some 
fruit {irif > 0). We first consider the (8.3), (8.5) with = 1 and with the m! replaced by 1. 

Such case contains the case of fruitless trees: once understood one obtains the general case quite easily; see 
below. 

To simplify the algebra wc split (8.3) and (8.5) as a sum of many (c™ for some large c, sec below) terms. 
The splitting is rather trivial and it is performed with the aim of being left only with integrals of functions 
which are products of single argument functions. Recall that d = -I ± ^, sec (8.5). 

1) each w{t, t') is spHt into two addends, by using the expression (8.4), i.e. woi{t) woo{t') — woo{t) woi{t') 
is regarded as a difference of two terms, and so is r — r'. This splits each of (8.3), (8.5) into up to 2"* terms 
(in the estimates we sometimes exceed to get simpler expressions: here we bound mo by m, for instance). 
Hence we produce up to 2™ terms. 

2) each woiit) = '2{gt) {cosh gt)~^ — 2sinh5i is split into its two composing addends. Hence we produce, for 
each of the preceding ones, up to 2™ more terms. 

3) in the case of (8.5) we split the qtv + coming from the action angle elements of the wronskians, or 
from woi, into the two addends composing them, thereby producing up to 2™ more terms for each (see 2)) 
preceding ones. 

4) wc split into two addends getting a factor 2"™" and up to 2"'" < 2"* more terms (for each of 
the preceding), and 2~"^° will compensate in the bounds the 2"*° that we get by extracting the factor 2 
appearing in 2) above. Note however that it is not important to keep track of powers of to: we are doing 
the counting just to help the reader checking what we are doing. 

The integral is thus split as a sum of up to 2^™ terms each of which has the following form {Uy = Q ■ V^): 

where the terms in square brackets are present only if we are condidering (8.5) and the yJ(T) are elements 
of a finite set of functions: namely, in the case of (8.3), e*"^M with n = 0,±1, ±A^o if is the 
trigonometric degree of / in (1.1), and {coshgt)~^ , smhgt, see (2.15). Here /9„ = ± and it is not the sign 
of Ty, but it is an independent variable. The factor 2~"^°, found in step 5) above, is not included in (Al.l) 
and it will be brought back later, (see (A1.2) below). The product over v is over the mo "free nodes" of t?, 
i.e. which correspond to actual integration operations. And t^^ = 0. 

Likewise, in the case of (8.5), the (r) are to be found among the e*"'''('^+'f with n = 0, ±1, . . . , ±No, 
and {cosh{gT + iO)~^, smh{gt + i^), and, as said above, the terms in square brackets have to considered 
present. 

Furthermore Uy < my + 1 is the number of factors Ty that are collected from the wronskians in performing 
the above operations. But J^y < m, because each node v can contain at most one factor Ty coming from 
the wronskians (recall that only W()i{t) and Wij with < i < 1,1 < j < 21 — 1 can contain r explicitly). 

The functions yj are holomorphic for | Imgij < ^ — ^ or, if thought as functions oi x — 6^°^^* , a = signi, 
arc holomorphic for |a;| < 1 and outside the cone with half opening ^ centered at the origin and symmetric 
about the imaginary axis (the case ^ = refers to (8.3)). Furthermore they are in the class Ai introduced 
in §2, §3: sec (2.15) and (3.3). 

The contour integrals involving the Ry are over a small circle, around the origin, and they really denote the 

evaluation of the appropriate residues, see §8. 

Note, as this will be quite relevant below, that the y functions, as functions of x at fixed c arc holomorphic 
near x = ±1, as they only have polar singularities at x = ±i or at ±e'^ 

We reconstruct (8.3), from the integrals (Al.l), as a linear combination of the above intergals, (up to 2"*™, 
as one can check by keeping track of the above accounted proliferation of the terms), times suitable factors. 
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all bounded by: 

and A'' is the maximum degree of the trigonometric polynomial / in (1.1) and F > 1 is a bound on the 
Fourier components of { Ja!f')~^ f ■ The factors bounded by powers of ^ (< mo) coming from the binomial 
expansions met in the splittings considered in 3) have been bounded by ^ to simplify the notation. 

5) We consider one of the nodes v following vq and split the integration over Ty, i.e. 'f^p^'^dT^, as a sum of 

if p oo ^p"° oo ' leaving for later consideration the first two choices. And we repeat the procedure, 
generating up to 3™ terms: all of them left for future consideration except the one which has all the 
variables integrated in the interval [0, py^oo), i.e. all having the same sign. We shall return to the terms left 
out for future consideration in item 8) below. 
We are thus left with an integral like: 

/r' niu + l 

^ dr4e«"'«]e«"^"9e*'^"^''(5T,)"'' [] yj(r,) (A1.3) 

where the terms in square brackets are present only in the case (8.5) and p^^oo has been supposed, to fix 
the ideas, to be — oo. As before the term in square brackets is present only in the case of (8.5). 
Of course one might think that we are undoing all that was painfully done in §6. And in fact this is 
essentially the case; the work of §6 was performed only to exhibit the cancellations. 

We shall let d be f in the case (8.3), while in the case (8.5) it will be actually d (to unify the notation). If 
we now show that (A1.3) can be bounded by: 

|J„|</^^i?r[<^-'''"] max {g-'\a}-u\)-''^ (A1.4) 

0<|!7|<JVm 

we will have shown that all the up to 3™ terms generated by the above decoposition of the above integral are 
bounded by the same quantity. In fact the terms left above "for later consideration" are manifestly products 
of quantities bounded by • . . . • with ^ to^ = m. 

6) We write the first integral in (A1.3) as dr^g + /"^-i dTy^: this gives us two terms. The first of 
which is: 



" •( n [e^'^V^^^'e'^^^^igrX^ JJ yj(r„)) (^1.5) 

... -oo ,.\„._ A — 1 



V>Vq ^ v>vo j=l 



We shall come back to the second integral in item 7) below. 
The functions yir) can be expanded into a series: 



-. oo 



X 
p=0 

and some of the yj^ may vanish, but not necessarily all (because of the possible choice: y(r) = sinhgr). 
If each of the j/j is expanded as in (A1.6), then each of the (Al.l) is broken into a sum over labels {fej}, 
with V & "d, and j = 1, . . . , rriy + 1, with convenient weights, namely: 

n n j/i^" (^1-7) 



of integrals like: 



W r e+^"^^" [e*^-«] e(^-^+^^)-"x^ dgr, {Al. 

^..^M J — OO 



where v' denotes the node immediately preceding v, and the auxiliary paramenters have to be put equal 
to after differentiation while the will have to be integrated over the above small contour around the 
origin. The [e'^^^'] factors are present only in the case of (8.5). 
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The integrals (A1.8) are performed hierarchically. This means that we first integrate with respect to the 

Tt,'s with V being a top node, in arbitrary order. 
To perform such integration we use (3.8): 



X e agr l^^-^J 



valid if the denominator does not vanish, (needless to say), and if < < 0. 

After integrating over the Ty variable corresponding to a top free node v, arbitrarily fixed, we compute the 
Ey derivatives and the residues at i?„ =0, and then we shall repeat the procedure after deleting the top free 

node V and the branches outgoing from it. 

The Ey derivatives can act either on the numerators or on the denominators of (A1.9). The riy differentia- 
tions with respect to Ey give results, if v' denotes the node immediately preceding v, like: 

{ky +Ry+ i5-lW„)"+l " ^ ^ ' 

with n + n' = Uy and the terms like (Al.lO) add up with coefficients ±1 to give the result of the derivative 
of the Ty integral that we are considering. The number of addends is not greater than 2"" (because we are 
taking derivatives of a product of at most two factors, when A„ = 1). Here LUy = lu ■ iTy . 
The residue of times (Al.lO) at i?„ = is simply (Al.lO) itself evaluated at i?„ = if fc„ or g^^ojy are 
different from (because at i?„ = the denominator in (Al.lO) does not vanish being bounded below by 
either \ky\ > 1 or by [^"^a;^!; and otherwise it is: 



n! 



(n + 1) 

still with n + n' = Uy and, as above, the terms with are present only in the bound of (8.5). Developing 
the binomial in (A 1.11) we can say that the result of the derivative and residue evaluation is a sum of at 
most 2"'°+-'^ terms, per each of the already found 3^2""+^, which have the form: 

Gh\xf-e'^"''-'{gTy>f, x' = e'"-'''^'^ (A1.12) 



with h -\- n < ny -\- \ and ujy = Q • Vy\ the latter in general will be, when considering the other integrals 
correponding to inner nodes, the sum of the modes Vy, with w > v. The coefficients G are bounded by 
the maximum between 1 (corresponding to the a bound on the denominator in (Al.lO) when ky ^ 0), or 
(mino<|i7|<7v 1'^ • V\g^^)^''^"^^ (corresponding to ky = 0, w„ 7^ 0) or (77/2)""+^ (corresponding to ky = u)y = 0, 
n = ny,n' = 0, using (Al.ll) and recalling that |^| < |). 

Continuing the procedure we integrate successively over the r variables of all the top free nodes (in arbitrary 
order), each time deleting the considered node and its outgoing branches. 

Therefore after all the integrations have been performed, all derivatives and residues computed, we shall 
have expressed the result of the evaluation of (Al.3) (taking into account the listed proliferation of terms 
described after (A1.4), (3™), after (Al.lO), (n„2""), and before (A1.12), (n«2"-+i)), as a sum of up to 
3'"2^'" terms: we use here that Y^Uy <m as there can be only up to one factor Ty per node. Each of which 
is bounded by: 

e 
e 

because the factors Xy are < in the integration interval, and by (A1.9), eventually, they are cvaluatc^d 
at Xy = e^^; here ci is a suitable constant and in the cases Z > 2 we supposed g/|wo| > 1 and neglect 
the (favourable) ^ in w = ujq/ to simplify the algebra (which is not restrictive unless one cares about 
numeric;ally good estimates, which is not our desire here). In the case I = 2, on the contrary, we use explicitly 
that min-_^jj |a; • i7| = wo/v^' very special for this "non resonant" case. 

The reason why the minimum has to be considered only for \v\ < Nm is that at each integration the 
oscillating exponent will have the form Q ■ v with v being sum of the, at most m, modes present in / in (1.1) 
(and in fact of toq < m) . 



'^v^Se^. max (<7-i|(I;o- i7|)-^" if />2 

^<\V\<Nm [^Al.VS) 

'^".^''^'cj'' • 1 if 1 = 2 
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Collecting all the above terms and estimates and using J2v <m we have a bound on (A1.8): 

AT™ (Cm^)2™ < A^™to!2^ if I > 2 / /i i 1 4^ 

Af^ if / = 2 ^^^■^^> 

for some suitable constants A/j , and if C, r denote respectively the diophantine constant and the diophantine 
exponent of wq, see (1.3). 

The latter bound should be multiplied by the absolute values of the weights (A1.7), by (A1.2) and by the 
"multiplicity" (produced in the course of the analysis 1)%5)) 2^™ in order to produce, after summing over 

the k labels in (A1.7), a bound on (8.3) and on (8.5) with the /?■'■ — 1. 

The weights are Taylor coefficients of a few {i.e. a finite number) of functions of x, with radius of convergence 

Therefore they can be bounded by a common bound Mx on the maxima of such functions in any disk of 
radius A < 1 times A~*. The integrals over the t's run over the interval {—oo,—g~^) so that the Xy are 
always < e~^. Hence, for A = 2~^, we get convergent bounds because of the exponential factors in (A1.13). 
Clearly this was the reason for the splitting of the integrals over the Ty into the part with Ty < —g~^ and 
the part with Ty € [— £("^,0]. 

Note that there are at most (2A'o + 3) different functions y and at most 3m of them appear: and of these 
there will be up to m trigonometric functions of nipQ{T) (in the case of (8.3)) or of nipQ{T + i^) (in the case of 
(8.5)) and up to 2m functions coming from the wronskians. The first admit a bound proportional to dr"^^" 
as the cos ^po{t) and sin</?o(T) carry a polar singularity of second order on the unit circle at distance of order 
d (i.e. ^) from the real axis. The wronskians are bounded proportionally to as they carry, at worst, a 
simple pole on the unit circle. 

Hence we see that a bound on the the products of yj that can be met in the integrals (8.5) has the form 
^m^~2{Na+i)m gQj-^Q constant M*. In the case of (8.3) we can take the bound to have the same form 
with the same A'/* and = ^ , say. 

7) We can therefore switch to considering the "left out part" jlg-i ^Tyg. Let vi be one of the nodes following 

Vq. We break the integral 'f^^'^^dTy-^ as dTy-^ + /^g-i dTy-^ . If we make the first choice and if TOi is the 

number of nodes following vq in the direction vi the first integral can be bounded, by the previous argument, 
by and we arc left with the problem of bounding • j'^g-i dry^ Yiy3i}^ ^'f^oo • ' •' 
We repeat the procedure hierarchically and we are eventually left with up to 2™ integrals like . . ./^^ 
ritie^ /-g'-i '^'''v ... to bound, where is a subtree of with m nodes, and m + ^ mj = m. 
The last integral is manifestly bounded by the maximum of the integrand, which is ^'"(^-^(JVo+i)™, £qj. 
suitable B, because J^^-i dgr = 1, we see that there is a constant B2 such that each of the the up to 3'"2^'" 



terms we generated is bounded by 



-2{No+l)m] 



8) If one selects for consideration any other of the 3™ — 1 integrals left aside in the analysis of item 5), 
see lines preceding (A1.3), one is left with essentially the same problem discussed in items 5)%7) above. In 
fact such choices involve factorized integrals, each of which has exactly the same form as the one studied in 
5)%7) above. 

9) We conclude that the final bound on the sum over the mode values v and over all the other mode labels 
(which is a sum over up up to to {2N + 1)'"* terms) of the absolute values of (8.3) or (8.4) with all the 
/?;;; = 1, is: 

cT-M^ m!^^, in the case of (8.3) 

(Al 15) 

e-|«'3-i7o|rf-2(iVo+i)m^m^m^,2r^ in the casc of (8.5) 

with = ^ — d and C3 conveniently fixed; and we shall use the (Al.15) to get a recursive bound on the 

coefficients 0''\ Furthermore if I = 2 the diophantine exponent t can be taken t = 0. 

The (Al.15) can be used to play the role plaid by the bound on the fruitless trees in the KAM proof in 
§7 in terms of Dq^Bq: in our case the corresponding quantities are given by eiAd^c^d^^^^^°~^^\2h)'^'^ and 
M^c'^d~^'^^^°^^'' {2h)^'^ for trees with order up to h. The extra power of 2 is lost here because to < 2t (recall 
that the order /i of a tree is in general different from the number of nodes because some nodes may carry an 
order label 6y = 0, and h < m < 2h). 

And one has simply to make an argument parallel to that presented in the paragraph containing (7.4) and 
(7.5). The cancellations discussed in §6 show that, among the subtrees that are generated by "magnifying 
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the fruit seeds", there has to be at least one which will provide a small factor e l^'^ '^ol for some v with 

\v\ < Nh. Wc leave the details to the reader, to avoid repetitions of the bounds discussed in §7. 
The final estimate of D, B is that /? = 4(A^o + 1)) P = 4t for the same reason which shows that in (7.5) one 
can take D, B proportional to Dq^Bq. 

Remark: A more careful analysis of the higher orders would show that this can be improved to < t)^ 
with Q > A^o + |) simply because the origin of the 4:Noh was a "poor" bound on the number of factors 

j^^i{nip+v-a) pj-esent in the expressions of the splitting to order h and each of which contributes a d~'^^" , 
(hence eventually a (y^)~^^°): the number of such factors, in deriving (8.2), by 2h. But to order h there 
are, obviously, exactly h such factors, and we can therefore essentially replace 4A^o by 2Ao (changing the 
various constants). 

If J = +00 the above analysis can be repeated: wc realize that several simplifications take place. For 
instance = if j = 1, 1 and in fact D can be replaced by Dj with Dj = D if j = and Dj = ^ if 
j > 0. It follows also that the trees with at least one inner branch with j\ = 1 give a vanishing contribution 
(corresponding to the fact that = as the angles a are isochronous, i.e. a = io even if /x ^ 0). The 

splitting being JgP^'j we see that the second bound in (8.2) holds uniformly in the size of J > Jo, i.e. the 
constants D, B, (3,p do not depend on J. This is again as in the KAM theory of §7. 

The first bound in (8.2) is more delicate, if J = +oo, as it is not "purely algebraic", resting on the KAM 

theory: if / = 2 one does not really need the KAM theory as mentioned in §1 and also the first boimd 
holds. If / > 2 the KAM theory does not apply, in general, and only the second estimate (8.2) is proved by 
the above arguments. This implies in particular the (not surprising) fact that perturbation theory for the 
whiskers and their splitting is well defined to all orders; but convergence is not guaranteed {i.e. the first of 
(8.2) may fail) and one cannot guarantee even the persistence of the invariant tori. 

To summarize (8.2) holds uniformly in J > Jo if Z = 2; if Z > 2 only the second of (8.2) holds uniformly in 

J > Jo, and it is not even sufficient to yield the convergence of the formal perturbation theory for the tori 
and the splitting. If I > 2 the first of (8.2) holds at fixed J < +oo: as needed and claimed in §8. 

A conjecture. 

Nevertheless we think that even if I > 2, and for all J < +oo, but with J > Jq, the t can be set equal to 0: 
in other words we conjecture that the above theory can be completely freed from its dependence on a KAM 
type of proof of existence of the invariant tori and whiskers (like the quoted [CG]), so that the two theories 
are essentially equivalent and, independent even if I > 2. We believe that the essential has been done already 
in the present paper. One would have to improve a little the cancellations analysis of §6. And the form of 
(1.1) will have to play an essential role (as it already did above). The main feature of (1.1) to keep, in order 
to be able to believe the above conjecture, is the absence of action variables in the perturbation /. 

Appendix A2: Symmetry of rootless trees. 

We first prove (6.12). Integrating the second of (6.11) over i?2 on circles with radii r\,r2 with < r2 < ri 
and ri small enough we get a l.h.s. equal to the l.h.s. of the relation (6.12); while the r.h.s. of (6.12) is 
obtained by taking < ri < r2, and r2 small enough. Hence we want to deform the radius ri to become 
larger than r2. This can be done if no singularities in i?2 are met near the product of two circles with 
radius r2. The evaluation of the double integral in (6.11) leads to a fmiction which can have some singular 
terms when i?i = ±i?2: it is easy to see, representing F,G as a. sum of finitely many terms like (3.2) plus 
very fastly decreasing functions at ±oo, that the singular terms have the form (i?i±i?2) n > 0. Therefore 
after multiplying them by R^^R^^ wc sec that they have equal doiible residue at 0, whether we take first 
the residue in i?i or in J?2 (and the result is zero in both cases). So that (6.12) follows. 

Given a tree t^o (with all its labels) we can consider the object obtained by deleting the root branch (with 
its labels). The object d thus obtained will be a rootless tree 'd. 

The same rootless tree can be obtained from several trees, as by deleting the root branch we forget the 
action label it beared. And we also forget which node was the first node. 

We can find the trees which generate the same rootless tree ?9 by attaching a root branch to any of the free 
nodes v £ d. Some restrictions may apply to the labels that can appear on the new root branch. In fact 
from the rules of tree labehng of §5, and fron the (2.14), (4.10) it appears that if the type (5„ of the node is 
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1 no restrictions apply; but if ^t, = the action label of the new branch has to be 0. 

A rootless tree with a distinguished node will be a pair of a rootless tree and one of its nodes. 

It is convenient to define the value V{^,v) of a rootless tree with a distinguished node: we shall use the 
following prescription. 

1) Imagine to attach a root branch at the node v with any possible action label. 

2) Evaluate the tree value with the rules of §5, but the node function introduced in (5.12) relative to the 
distinguished node will be modified by deleting the factor —^{i'v)jv and replacing it by 1. 

3) The 5j operation is always X (even when the attached root branch action label is = 0) which would 
require, for a normal tree, the interpretation fj(-) =X(woo'))- 

By construction Vip^ v) does not depend on the label jy that we attach to the new root to perform the 

above calculation. In principle, however, it does depend on the distinguished node v. The main consequence 
of the symmetry (6.12) is that in fact it is v independent. Thus we can denote it V^(^), calling it the value 
of the rootless tree '9. 

To prove the above independence we write the just defined value V{^, v) as: 

/+00 r r 

-°° p=l p=l 

if r is the number of branches arriving in v and hp is the order of the p-th branch (defined as the sum of the 
type labels of the nodes that can be reached leaving v along the p-th branch; the jp is the angle label of the 
p-th branch. The w{t,t') is either (t — r') if j\ > or 'Woq{t)wqi{t') — ■u;o((r)woo (''"') if ji = (apart from 
some proportionality constant fixing the dimensions). The X^^{t) is the result of the integrations over the 

time labels of the nodes of 'd which follow the p-th branch emerging from v (in the order generated on 'd by 
the insertion of a root branch). 

Let v' be one of the r nodes linked by a branch ol^tov. We suppose that the branch is the one corresponding 
to p = 1. Then, by the definition of the 7?.-tree evaluation we see that: 

p P°° q=l q=2 

if s is the number of branches arriving in v' and hq is the order of the q-th branch (defined as the sum of 
the type labels of the nodes that can be reached leaving v' along the q-th branch. The j'^ is the angle label 

of the q-th branch. 

Substituting (A2.2) into (A2.1) we get an expression like the l.h.s. of (6.12): then it is immediate to check 
that the expression to which it is equal, by applying (6.12), is in fact V{'d,v'). 

Finally one remarks that the value of a tree do with root label I + jy„ with j = > is expressed in terms 
of the corresponding rootless tree -d value as —\{ivvo)jV{j)). This would not be true for j = because the 
evaluation of the tree value would require using the operation J(woo') instead of X. 

If i^o, • • • ,1%! arc the trees that can be associated with a given rootless tree and which have a root 
branch action label I + j with j > 0, we see that the sum of their values is: 

~Y.^^v,)jV{d) = -'-{P,)iV{d) (^2.3) 

k 

where Vq is the total free mode of the trees (which is the same for all). 

Hence we see that if the total free mode vanishes the corresponding trees give a vanishing contribution to 
the homoclinic splitting. And the cancellation takes place separately among the trees that have the same 
rootless tree. In particular we can say that the family of all the trees with only ripe fruits and total free 
mode gives a zero contribution to the action splitting for j > 0, (of course the same could be said of the 
family of the trees with only dry fruits, or with a prefixed number of dry and ripe fruits and of free nodes). 



Appendix A3: Analysis of the approximate cancellations. 

We have seen in §7 the basic mechanism, given for the purpose of illustration, showing that, restricting the 
sum in (6.23) to a sum over the trees such that: 
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I) I'fiv) 7^ if w is any node (seeds included). 

II) i^fiv) 7^ i^f{v') for all pairs of comparable nodes v',v, (not necessarily next to each other in the tree order, 

however), with v' > va. 

leads to a convergent series. The trees not verifying II) are called resonant by Eliasson. And in this section 
we deal with them. Which is the hardest part of the problem and the most original contribution by Eliasson 
to the field. 

I shall consider only the case of trees without fruits, as the reduction, in §7, to this case was not based on 
assumption II). 

However there are resonant trees. The key remark is that they cancel almost exactly. The reason is very 
simple: imagine to detach from a tree i? the subtree with first node v. Then attach it to all the remaining 
nodes w > v' ,w e '&/'&2- We obtain a family of trees whose contributions to h^^^ differ because: 

1) some of the branches above v' have changed total momentum by the amount i^{v): this means that some 
of the denominators (w • v'{w))~'^ have become (w • iy{v) ± e)"^ if £ = • i'iv); and: 

2) because there is one of the node factors which changes by taking successively the values Vyjj, j being the 
branch label of the branch leading to v, and w G '&/'d2 is the node to which such branch is reattached. 

Hence if a; • ;7 = e = we would build in this resummation a quantity proportional to: '^i^w = ^{v) — 
which is zero, because i^{v) = u{v') means that the sum of the j^j's vanishes. Since uj ■ v = e ^ Q we can 
expect to see a sum of order e^, if we sum as well on a overall change of sign of the Vw values (which sum 



But this can be true only if £ <C ci; • r?', for any branch momentum V of a branch in 'd/'&2- If the latter 
property is not true this means that Co ■ V is small and that there are many nodes in ?9/t?2 of order of the 
amount needed to create a momentum with small divisors of order e. 

Examining carefully the proof of Brjiuno's lemma one sees that such extreme case would be essentially also 
treatable. Therefore the problem is to show that the two regimes just envisaged (and their "combinations") 
do exhaust all possibilities. 

Such problems are very common in renormalization theory and are called "overlapping divergences". Their 
systematic analysis is made through the renormalization group methods. We argue here that Elliasson's 
method can be interpreted in the same way. 

The above introduced trees will play the role of Feynman diagrams; and they will be plagued by overlapping 
divergences. They will therefore be collected into another family of graphs, that we shall call trees, on which 
the bounds are easy. The {uj ■ V)~'^ are the propagators, in our analogy. 

We fix an scaling parameter 7, which we take 7 = 2 for consistency with (7.1), and we also define w = CoWq: 
it is an adimensional frequency. Then we say that a propagator (cJ • z7)~^ is on scale n if 2"~^ <\uj- v\ < 2", 
for n < 0, and we set n = 1 if 1 < • 

Proceeding as in quantum field theory, see [G3], given a tree 1? we can attach a scale label to each branch 
v'v in (6.23) (with v' being the node preceding v): it is equal to n if n is the scale of the branch propagator. 
Note that the labels thus attached to a tree are uniquely determined by the tree: they will have only the 
function of helping to visualize the orders of magnitude of the various tree branches. 

Looking at such labels we identify the connected clusters T of nodes that are linked by a continuous path 
of branches with the same scale label ut or a higher one. We shall say that the cluster T has scale ut- 

Among the clusters we consider the ones with the property that there is only one tree branch entering them 
and only one exiting and both carry the same momentum. Here we use that the tree branches carry an 
arrow pointing to the root: this gives a meaning to the words "incoming" and "outgoing" . 

If V is one such cluster we denote Ay the incoming branch: the branch scale n = nxy is smaller than 
the smallest scale n' = ny of the branches inside V. We call wi the node into which the branch Xy ends, 
inside V. We say that such aV is a resonance if the number of branches contained inVis< E2~^^ , where 
n = n\y, and E,e are defined by: E = 2~^^N~^, e = t~^. We shall say that nxy is the resonance scale. 

Let us consider a tree 1? and its clusters. We wish to estimate the number Nn of branches with scale n < 
in it, assuming 7V„ > 0. 

Denoting T a cluster of scale n let TOt be the number of resonances of scale n contained in T [i.e. with 
incoming branches of scale n), we have the following inequality, valid for any tree 



up to 0). 




+ 




T, nT=n 



with E = N-^2-^^,e 



T ^. This is a version of Brjuno's lemma: a proof is in appendix A4. 
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Consider a tree we define the family ^(^?^) generated by i?^ as follows. Given a resonance V of "3^ we 

detach the part of i?^ above Ay and attach it successively to the points w G V, where V is the set of nodes 
of V (including the endpoint wi of Ay contained in V) outside the resonances contained in V. Note that all 
the branches A in F have the same scale n\ = nv- 

For each resonance V of -d^ we shall call My the number of nodes in V. To the just defined set of trees we 
add the trees obtained by reversing simoultaneously the signs of the node modes Uw, for w €V: the change 
of sign is performed independently for the various resonant clusters. This defines a family of Yl 2My trees 
that we call J^{-di). The number f] 2My will be bounded by exp^] 2My < e^'^. 

It is important to note that the definition of resonance is such that the above operation (of shift of the node 
to which the branch entering V is attached) does not change too much the scales of the tree branches inside 
the resonances: the reason is simply that inside a resonance of scale n the number of branches is not very 
large being <Nn = E 2-"^. 

Let A be a branch, in a cluster T, contained inside the resonances V = Vi C V2 C . . . oi scales n = m > 
n2 > . . .; then the shifting of the branches Ay^ can cause at most a change in the size of the propagator of A 
by at most 2"! + 2"2 + . . . < 2"+^ _ 

Since the number of branches inside V is smaller than Nn the quantity cj-i'x of A has the form oj-i^+axCJ-i'Xy 
if is the momentum of the branch A "inside the resonance V, i.e. it is the sum of all the node modes of 
the nodes preceding A in the sense of the branch arrows, but contained in V; and ax = 0, ±1. 

Therefore not only |w • > 2"+^ (because is a, sum of < Nn node modes, so that < NNn) but 
Lo ■ fy'l is "in the middle" of the diadic interval containing it and by (7.1) does not get out of it if we add a 
quantity bounded by 2"+^ (like axco ■ vxv)- Hence no branch changes scale as ^ varies in J^{"&^), if ujq verifies 



This implies, by the strong diophantine hypothesis on luq, (7.1), that the resonant clusters of the trees in 
J^{'&^) all contain the same sets of branches, and the same branches go in or out of each resonance (although 
they are attached, to generally distinct nodes inside the resonances: the identity of the branches is here defined 
by the number label that each of them carries in ). Furthermore the resonance scales and the scales of the 
resonant clusters, and of all the branches, do not change. 

Let 1?^ be a tree not in T{'d^) and construct etc. We define a collection {.F(i?')},=i,2,... of pairwise 

disjoint families of trees. Wc shall sum all the contributions to fv^^ coming from the individual members of 
each family. This is the Eliasson's resummation. 

We call ev the quantity w • vxv associated with the resonance V. If A is a branch with both extremes in 

V we can imagine to write the quantity uj ■ i^x as • + (Jyey, with cry = 0, ±1. Since jcD • ;7x| > 2"^~^ 
we see that the product of the propagators is holomorphic in ey for |£y | < 2"^""^. While ey varies in such 
complex disk the quantity |a; • Vx] does not become smaller than 2"^~^ — 2"^~^ > 2""^"^. Note the main 
point here: the quantity 2"^~^ will usually be ^ 2'^^v~'^ which is the value ey actually can reach in every 
tree in J^(d^); this can be exploited in applying the maximum priciple, as done below. 
It follows that, calling nx the scale of the branch A in 1?^, each of the Y\ 2My < e^'^ products of propagators 
of the members of the family can be bounded above by 2"^'"^^^^ = 2^'^ 2^^"^, if regarded as 

a function of the quantities ey = w • Vxy, for |ey| < 2"^"'^, associated with the resonant clusters V. This 
even holds if the ey are regarded as independent complex parameters. 

By construction it is clear that the sum of the ]~[ 2My < e^^ terms, giving the contribution to TS''^ from the 
trees in JF('(?^), vanishes to second order in the ey parameters (by the approximate cancellation discussed 
above). Hence by the maximum principle, and recalling that each of the scalar products in (6.23) can be 
bounded by N"^, we can bound the contribution from the family T{'d^) by: 



where: 

1) Nn is the number of propagators of scale n in (n = 1 does not appear as |a; • z7| > 1 in such cases), 

2) the first square bracket is the bound on the product of individual elements in the family T{'d^) times the 
bound e^^ on their number, 

3) The second term is the part coming from the maximum principle, applied to bound the resummations, 
and is explained as follows. 

i) the dependence on the variables ey. = £i relative to resonances Vi dT with scale uxy = nis holomorphic 
for for |ei| < 2"'^'^ if Ui = nvi, provided rij > n + 3 (see above). 



(7.1). 
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ii) the resummation says that the dependence on the Si's has a second order zero in each. Hence the maximum 
principle tells us that we can improve the bound given by the first factor in (A3. 2) by the product of factors 
(|£i| 2""'+"^)^ if Ui > n + 3. If rij < n + 3 we cannot gain anything: but since the contribution to the bound 
from such terms in (A3. 2) is > 1 we can leave them in it to simplify the notation, (of course this means that 

the gain factor can be important only when <C 1). 

Hence substituting (A3.1) into (A3. 2) we see that the rriT is taken away by the first factor in 2^"2~^"', 
while the remaining 2~^"* are compensated by the —1 before the +mT in (A3.1) taken from the factors 
with T = Vi, (note that there are always enough — I's). 

Hence the product (A3. 2) is bounded by: 

(C^ Jq- VoA^^)*'e2*=24'=26'= JJ 2"^"'=^"' < (yl3.3) 

n 

with: Bo = 2iOe2c2/oA^'exp[iV(22+2--^ log 2) T.7=iP'^-'^~\ 

To sum over the trees wc note that, fixed the collection of clusters is fixed. Therefore we only have to 
multiply (A3. 3) by the number of tree shapes for (< 2^*^^!), by the number of ways of attaching mode 
labels, (< (3A^)"°), so that we can bound \h^^^ \ by an exponential of k and (1.4) follows. 

Remarks. 

The strong diophantine condition is quite unpleasant as it seems to put an extra requirement on wq: I think 
that in fact such condition is not necessary: I explain why in [G3]. The basic reason is that one is not forced 
to introduce the scales 2" in an exact geometric growth: an approximate one, in which 2" is replaced by 7„ 
and 1 < 7„2~" < 2 would be enough. This gives much more freedom in fulfilling the (7.1), given ujq verifying 
(1.3) only. Such a squence can be shown to exist always if ujq verifies (1.3), possibly replacing Co by a larger 
constant (to fulfill the analogue of (7.1)). 



Appendix A4: Resonant Siegel-Brjuno bound. 

Calling iV* the number of non resonant lines carrying a scale label < n. We shall prove first that < 
2fc(^2-^")-i - 1 if AT* > 0. 

If I? has the root line with scale > n then calling '!9i,?92, • • • ,'&rn the subdiagrams of emerging from the 
first vertex of ?9 and with kj > E2-^"' lines, it is N*{-d) = N*{-di) + ... + N*('&m) and the statement is 
inductively implied from its validity for k' < k provided it is true that N*{'d) = if A; < £^2~^", which is is 
certainly the case if E is chosen as in (A3.1).* 

In the other case it is A^* < 1 + J^-^i -^*(i^j)) and if m = the statement is trivial, or if m > 2 the 
statement is again inductively implied by its validity for k' < k. 

If TO = 1 we once more have a trivial case unless the order fci of t?i is fci > fc — ^E2~"^. Finally, and this 
is the real problem as the analysis of a few examples shows, we claim that in the latter case the root line is 
either a resonance or it has scale > n. 

Accepting the last statement it will be: Af*(t?) = 1 + N*{'di) = 1 + N* + . . . + N*{'&'^,), with being 
the to' subdiagrams emerging from the first node of with orders kj > E2~'^^: this is so because the root 
line of I?! will not contribute its unit to N*{-di). Going once more through the analysis the only non trivial 
case is if to' = 1 and in that case N{'&[) = N*{'&"i) + . . . + N{'&"m"), etc, until we reach a trivial case or a 
diagram of order < k — ^E2~"^. 

It remains to check that if fci > A; — ^E2~'^^ then the root line of i?i has scale > n, unless it is entering a 

resonance. 

Suppose that the root line of i?i is not entering a resonance. Note that \uj ■ i'{vo)\ < 2", \uj ■ z7(u)| < 2", 
if vo,vi are the first vertices of i? and respectively. Hence S = \{lo ■ (j7(fo) — i^ivi))] < 2 2" and the 
diophantine assumption implies that |i7(i'o) ~ '^(^'i)! > (2 2")""^ , or ^{vo) = I'ivi). The latter case being 

* Note that if A; < E2-"^ it is, for all momenta u of the lines, \u\ < NE2-"^, i.e. \uj ■ u\ > {NE2-"^)-'^ = 2» 2" so that there 
axe no clusters T with nx = n and N* = 0. The choice E = N~^2~^^ is convenient: but this, as well as the whole lemma, 
remains true if 3 is replaced by any number larger than 1. The choice of 3 is made only to simplify some of the arguments 
based on the resonance concept. 
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discarded as A; — fci < 2~"^ (and we are not considering the resonances), it follows that k — k\< 2~"^ 
is inconsistent: it would in fact imply that i'(uo) — is a sum of A: — fci vertex modes and therefore 

117(110) — v{vi)\ < ^NE2~"^ hence (5 > 2^ 2" which is contradictory with the above opposite inequality. 
A similar, far easier, induction can be used to prove that if N* > then the number p of clusters of scale 

n verifies the bound p < 2k {E2~'^")~^ — 1. Thus (11) is proved. 

Remark: the above argument is a minor adaptation of Brjiuno's proof of Siegel's theorem, as remarkably 
exposed by Poschel, [P] . 
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